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Abstract 

Graphical model processing is a central problem in artificial intelligence. The optimiza¬ 
tion of the combined cost of a network of local cost functions federates a variety of 
famous problems including CSP, SAT and Max-SAT but also optimization in stochastic 
variants such as Markov Random Fields and Bayesian networks. Exact solving methods 
for these problems typically include branch and bound and local inference-based bounds. 
In this paper we are interested in understanding when and how dynamic programming 
based optimization can be used to efficiently enforce soft local consistencies on Global 
Gost Functions, defined as parameterized families of cost functions of unbounded ar- 
ity. Enforcing local consistencies in cost function networks is performed by applying 
so-called Equivalence Preserving Transformations (EPTs) to the cost functions. These 
EPTs may transform global cost functions and make them intractable to optimize. We 
identify as tractable projection-safe those global cost functions whose optimization is and 
remains tractable after applying the EPTs used for enforcing arc consistency. We also 
provide new classes of cost functions that are tractable projection-safe thanks to dynamic 
programming. We show that dynamic programming can either be directly used inside 
filtering algorithms, defining polynomially DAG-filterable cost functions, or emulated 
by arc consistency filtering on a Berge-acyclic network of bounded-arity cost functions, 
defining Berge-acyclic network-decomposable cost functions. We give examples of such 
cost functions and we provide a systematic way to define decompositions from existing 
decomposable global constraints. These two approaches to enforcing consistency in global 
cost functions are then embedded in a solver for extensive experiments that confirm the 
feasibility and efficiency of our proposal. 
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1. Introduction 


Cost Function Networks (CFNs) offer a simple and general framework for modeling 
and solving over-constrained and optimization problems. They capture a variety of prob¬ 
lems that range from CSP, SAT and Max-SAT to maximization of likelihood in stochastic 
variants such as Markov Random Fields or Bayesian networks. They have been applied 
to a variety of real problems, in resource allocation, bioinformatics or machine learning 
among others [U |6l|, IH, 64, 2, 371. 

Besides being equipped with an efficient branch and bound procedure augmented with 
powerful local consistency techniques, a practical CFN solver should have a good library 
of global cost functions to model the often complex scenarios in real-life applications. 

Enforcing local consistencies requires to apply Equivalence Preserving Transforma¬ 
tions (EPTs) such as cost projection and extension 23|. Most local consistencies re¬ 


quire to compute minima of the cost function to determine the amount of cost to 
project/extend. By applying these operations, local consistencies may reduce domains 
and, more importantly, tighten a global lower bound on the criteria to optimize. This 
is crucial for branch and bound efficiency. Global cost functions have unbounded arity, 
but may have a specific semantics that makes available dedicated polynomial-time algo¬ 
rithms for minimization. However, when local consistencies apply EPTs, they modify 
the cost function and may break the properties that makes it polynomial-time minimiz- 
able. We say that a cost function is tractable if it can be minimized in polynomial time. 
The notion of tractable projection-safety captures precisely those functions that remain 
tractable even after EPTs. 

In this paper, we prove that any tractable global cost function remains tractable after 
EPTs to/from the zero-arity cost function (H^), and cannot remain tractable if arbitrary 
EPTs to/from r-ary cost functions for r > 2 are allowed. When r = 1, we show that 
the answer is indefinite. We describe a simple tractable global cost function and show 
how it becomes intractable after projections/extensions to/from unary cost functions. 
We also show that flow-based projection-safe cost functions [i^ are positive examples of 
tractable projection-safe cost functions. 

For r = 1, we introduce polynomially DAG-filterable global cost functions, which can 
be transformed into a filtering Directed Acyclic Graph with a polynomial number of 
simpler cost functions for (minimum) cost calculation. Gomputing minima of such cost 
functions, using a polynomial time dynamic programming algorithm, is tractable and 
remains tractable after projections/extensions. Thus, polynomially DAG-filterable cost 
functions are tractable projection-safe. Adding to the existing repertoire of global cost 
functions, cost function variants of existing global constraints such as Among, Regular, 
Grammar, and Max/Min, are proved to be polynomially DAG-filterable. 

To avoid the need to implement dedicated dynamic programming algorithms, we 
also consider the possibility of directly using decompositions of global cost functions 
into polynomial size networks of cost functions with bounded arities, usually ternary 
cost functions. We show how such network-decompositions can be derived from known 
global constraint decompositions and how Berge-acyclicity allows soft local consistencies 
to emulate dynamic programming in this case. We prove that Berge-acyclic network- 
decompositions can also be used to directly build polynomial filtering DAGs. 

To demonstrate the feasibility of these approaches, we implement and embed various 
global cost functions using filtering DAG and network-decompositions in toulbar2, an 
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open source cost function networks solver. We conduct experiments using different bench¬ 
marks to evaluate and to compare the performance of the DAG-based and network-based 
decomposition approaches. 

The rest of the paper is organized as follows. Section [5] contains the necessary back¬ 
ground to understand our contributions. Section [3] analyses the tractability of enforcing 
local consistencies on global cost functions and characterizes the conditions for preserving 
tractability after applying EPTs. In Section |4] we define DAG-filtering and in Section [5] 
we give an example of a polynomial DAG-filterable global cost function. Sections [5] and 
[3 present network-decomposability and the conditions for preserving the level of local 
consistency. Section [5] shows the relation between network-decompositions and DAG- 
filtering. Section [3] provides an experimental analysis of the two approaches on several 
classes of problems. Section ITOl concludes the paper. 


2. Background 

We give preliminaries on cost function networks and global cost functions. 

2.1. Cost Function Networks 

A cost function network (CFN) is a special case of the valued constraint satisfac¬ 
tion problem with a specific cost structure ([0,..., T], ©, <). We give the formal 
definitions of the cost structure and GFN as follows. 

Definition 1 (Gost Structure [i^)- The cost structure ([0,..., T], ©, <) is a tuple de¬ 
fined as: 

• [0,..., T] is the interval of integers from 0 to T ordered by the standard ordering 
<, where T is either a positive integer or -|-oo. 

• © is the addition operation defined as a (B b = min(T, a + 6). We also define the 
subtraction 0 operator for any a and b, where a > b, as: 

\ T, otherwise 


Note that more general additive cost structures have also been used. Specifically, VAG 
and OSAC [22| local consistencies are defined using a structure using non-negative ratio¬ 
nal instead of non-negative integer numbers. For ease of understanding, our discussion 
assumes integral costs. However, it can easily be generalized to rational costs. 


Definition 2 (Cost Function Network [i^). A Cost Function Network (CFN) is a tuple 
(A,W,T), where: 


m X is an ordered set of discrete domain variables {xi,X 2 ,. ■. ,Xn}. The domain of 
Xi € X being denoted as D(xi); 

• W is a set of cost functions Ws each with a scope S = {a^sj,..., Xs.^} Q W that 
maps tuples i € , where = D{xsi) x • • • x D^Xs^), to [0,..., T]. 
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When the context is clear, we abuse notation by denoting an assignment of a set of 
variables S' C df as a tuple £ = {vg^, ■ ■ ■ ,Vs^) € The notation £[xsi] denotes the 
value Vsi assigned to Xsi in £, and £[S'] denotes the tuple formed by projecting i onto 
S' C S. Without loss of generality, we assume W = {W 0 } U {Wi \ Xi £ X} U W'*'. W 0 
is a constant zero-arity cost function. Wi is a unary cost function associated with each 
Xi G X. is a set of cost functions Ws with scope S and |S| > 2. If W 0 and {Wi} are 
not defined, we assume Wi{v) = 0 for all v G D{xi) and W 0 = 0. To simplify notation, 
we also denote by Ws-i,s 2 ,...,sr the cost function on variables {xg^^Xg^,... ,^8^} when the 
context is clear. 

Definition 3. Given a CFN {X, W, T), the cost of a tuple £ G is defined as cost{£) = 
®Ws&w ^ tuple £ G is feasible ifcost{£) < T, and it is an optimal solution 

of the CFN if cost(£) is minimum among all tuples in . 

We observe that a classical Constraint Network is merely a CFN where all cost func¬ 
tions Ws G >V are such that '^£ G L‘®, Ws(^) G {0, T}. The problem of the existence 
of a solution in a constraint network, called Constraint Satisfaction Problem (CSP), is 
NP-complete. Finding an optimal solution to a CFN is thus above NP. Restrictions to 
Boolean variables and binary constraints are known to be APX-hard [H^. In the termi¬ 
nology of stochastic graphical models, this problem is also equivalent to the Maximum 
A Posteriori (MAP/MRF) problem or the Maximum Probability Explanation (MPE) in 
Bayesian networks CFNs can be solved exactly with depth-first branch-and-bound 
search using W 0 as a lower bound. Search efficiency is enhanced by maintaining local 
consistencies that increase the lower bound by redistributing costs among Ws, push¬ 
ing costs into W 0 and Wi, and pruning values while preserving the equivalence of the 
problem {i.e., the cost of each tuple £ G is unchanged). 


2.2. Soft local consistencies and EPTs 

Different consistency notions have been defined. Examples include NC* 144 
suslik, FD(G)AC* (weak) ED(G)AC* 
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, (G)AC* [6 
VAC and 


OS AC [ 2 ^. Enforcing such local consistencies requires applying equivalence preserving 
transformations (EPTs) that shift costs between different scopes. The main EPT is de¬ 
fined below and described as Algorithm [TJ This is a compact version of the projection 
and extension defined in [25|. 


Definition 4 (EPTs [^1. Given two cost functions Ws^ and Ws^, S 2 C Si, the EPT 
Project {Si, S 2 , £, a) shifts an amount of cost a between a tuple £ G of Ws 2 and the 
cost function Ws^- The direction of the shift is given by the sign of a. The precondition 
guarantees that costs remain non negative after the EPT has been applied. 

Denoting by r = \S 2 \, the EPT is called an r-EPT. It is an r-projection when a > 0 
and an r-extension when a < 0. 


It is now possible to introduce local consistency enforcing algorithms. 

Definition 5 (Node Consistency [i^). A variable Xi is star node consistent (NC*) if 
each value v G D{xi) satisfies Wi{v) © W0 < T and there exists a value v' G D{xi) such 
that Wi{v') = Q. A CFN is NC* iff all variables are NC*. 
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Precondition: -Ws^ii) <a< min^,^j^si VPsj (f); 

Procedure Project (S'!, S' 2 ,a) 

Ws,{£)^Ws,{£)(Ba-, 
foreach (£' £ such that ^'[ 82 ] = (■) do 
IWsAn^Wsdnea-, 

Algorithm 1: A cost shifting EPT used to enforce soft local consistencies. The ©,0 
operations are extended here to handle possibly negative costs as follows: for non-negative 
costs a, P, we have a 0 (—/3) = a © /3 and for /3 < a, a © (—/3) = a © /9. 


Procedure enforceNC*() 

1 foreach xi £ X do unaryProject(a:i); 

2 foreach xi £ X do pruneVar(a:i); 

Procedure unaryProject(a;i) 

3 a := min{Wi(u) | v £ D{xi)}-, 

4 Project({xi}, 0, 0, a) ; 
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Procedure pruneVar(a::i) 

foreach v £ D{xi) s.t. Wi{v) © W 0 = T do 
|_ D{xi) := D{xi) \ {t}; 


Algorithm 2: Enforce NC* 


Procedure enforceNC*() in Algorithm[2]enforces NC*, where unaryProject() applies 
EPTs that move unary costs towards W 0 while keeping the solution unchanged, and 
pruneVar(xi) removes infeasible values. 


Definition 6 ((Generalized) Arc Consistency 23, 3 E^)- Given a CFNP = (A, W, T), 
a cost function Ws G W'*" and a variable Xi £ S. 


• A tuple i £ is a simple support for v £ Dixi) with respect to Ws with Xi £ S 
iff£[xi] =v andWs{e) = 0. 

• A variable Xi £ S is star generalized arc consistent (GAC*) with respect to Ws iff 

— Xi is NC*; 

— each value vt £ D(xi) has a simple support I with respect to Ws- 


• A CFN is GAC* iff all variables are GAC* with respect to all related non-unary 
cost functions. 


To avoid exponential space complexity, the GAC* definition and the algorithm is 
slightly different from the one given by Cooper and Schiex (23j |. which also requires for 
every tuple £ £ L'®, Wsii) = T if IE 0 © Wi{£[xi]) © Wsi£) = T. 

The procedure enforceGAC*() in Algorithm [3j enforces GAC* on a single variable 
Xi £ X with respect to a cost function Ws £ W^, where Xi G S' in a CFN (A,W, T). 
The procedure first computes the minimum when Xi = v for each v £ D{xi) at line|31 
then performs a 1-projection from Ws to Wi at line|31 Lines [3] and |3] enforce NC* on Xi- 
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Procedure enforceGAC*(VPs, Xi) 
foreach v € D{xi) do 

a := Tah\{Ws{t) | f G L'^ A i[xi\ 
Project (S', {xi}, (v), a) 
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unaryProject(a;i); 

pruneVar(a:i); 


v}; 


Algorithm 3: Enforcing GAC* for Xi with respect to Ws 


Local consistency enforcement involves two types of operations: (1) finding the min¬ 
imum cost returned by the cost functions among all (or part of the) tuples; (2) applying 
EPTs that shift costs to and from smaller-arity cost functions. 

Minimum cost computation corresponds to line [5] in Algorithm [5J and line [3] in Algo¬ 
rithmic] For simplicity, we write min{Ws{i) \ i G L"®} as min{Ws}- 

In practice, projections and extensions can be performed in constant time using the 
A data-structure introduced in Cooper and Schiex [^. For example, when we perform 
1-projections or 1-extensions, instead of modifying the costs of all tuples, we store the 
projected and extended costs in A“. ^ and A+ ^ respectively. Whenever we compute the 
value of the cost function Ws for a tuple £ with l[xi] = v, we return Ws(£)©A“ j,©A+ 
The time complexity of enforcing one of the previous consistencies is thus entirely defined 
by the time complexity of computing the minimum of a cost function during the enforcing. 

Proposition 1. The procedure enforceGAC*{) in Algorithm\^ requires 0{d- fmin) time, 
where d is the maximum domain size and fmin is the time complexity of minimizing Ws ■ 

Proof. Line [3| requires 0(/mm) time. We can replace the domain of Xi by {u}, and 
run the minimum computation to get the minimum cost. Projection at line |3| can be 
performed in constant time. Thus, each iteration requires 0{fmin)- Since the procedure 
iterates d times, and the procedures unaryProject and pruneVar requires 0{d), the 
overall complexity is 0{d ■ fmin + d) = 0{d • fmin)- □ 

In the general case, fmin is in 0{d'') where r is the size of the scope and d the maximum 
domain size. However, a global cost function may have specialized algorithms which make 
the operation of finding minimum, and thus consistency enforcement, tractable. 


2.3. Global Constraints, Soft Global Constraints and Global Cost Functions 

Definition 7 (Global Gonstraint A global constraint, denoted by GC(S', Ai,... ,At), 

is a family of hard constraints parameterized by a scope S, and possibly extra parameters 

Ai,..., At. 


Examples of global constraints are AllDifferent [^, GCC [59j, Same [13|, Among 
Regular [^, Grammar [s^, and Maximum/Minimum constraints Q. Because of 
their unbounded scope, global constraints cannot be efficiently propagated by generic lo¬ 
cal consistency algorithms, which are exponential in the arity of the constraint. Specific 
propagation algorithms are designed to achieve polynomial time complexity in the size 
of the input, i.e. the scope, the domains and extra parameters. 
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To capture the idea of costs assigned to constraint violations, the notion of soft global 
constraint has been introduced. This is a traditional global constraint with one extra 
variable representing the cost of the assignment w.r.t. to an existing global constraint. 
The cost is given by a violation measure function. 


Definition 8 (Soft Global Constraint [57|). A soft global constraint, denoted by Soft_GC^(S'U 
{z'\, Ai,... ^ At), is a family of hard constraints parameterized by a violation measure fi, 
a scope S, a cost variable z, and possibly extra parameters Ai,..., At- The constraint is 
satisfied if and only if z = gL{S, Ai^..., At). 


Soft global constraints are used to introduce costs in the CSP framework, and there¬ 
fore inside constraint programming solvers 5^ . It requires the introduction of extra cost 
variables and does not exploit the stronger propagation offered by some of the soft local 
consistencies. A possible alternative, when a sum of costs needs to be optimized, lies in 
the use of global cost functions. 


Definition 9 (Global Cost Function 0, 0). A global cost function, denoted as 
W_GCF(S', Ai,..., At), is a family of cost functions parameterized by a scope S and 
possibly extra parameters Ai,..., At. 


For example, if 5" is a set of variables with non-negative integer domains, it is easy to 
define the Global Cost Function W_Sum( 5') = min(T,xt). 

It is possible to derive a global cost function from an existing soft global constraint 
Soft_GC''(5' U {z}, Ai, ..., At). In this case, we denote the corresponding global cost 
function as W_GCF^. Its value for a tuple ^ G L"® is equal to min(T,^(£)). 

For example, global cost functions W_AllDifferent’'“’'/W_AllDifferent'^®° [H, 
[s^ can be derived from two different violation measures of AllDifferent, namely 
variable-based and decomposition-based [^, l67| , respecti vely . Other examples include 


W_GCC’'“’' and W_GCC"“' m 


w_same"“’’ m 


W_Regular’'“'' and W-Regular®'^** W_EGCC’'“’' 

and W_Cumulative’'“* [5l.[5ll. 


15011 . W_SlidingSum^°’' |52 
, W_Disjunctive’'“* 


3. Tractable Projection-Safety 

All soft local consistencies are based on the use of EPTs, shifting costs between two 
scopes. The size of the smallest scope used in a EPT is called the order (r) of the EPT. 
Such a EPT is called an r-EPT. It is directly related to the level of local consistency 
enforced: node consistency uses EPTs onto the empty scope (r = 0), arc consistencies use 
unary scopes (r = 1) whereas higher-order consistencies use larger scopes (r > 2) [^ . 
In this section, we show that the order of the EPTs directly impacts the tractability of 
global cost function minimization. 

To be able to analyze complexities in global cost functions, we first define the decision 
problem associated with the optimization problem min{VF_GC'F(S', Ai,..., At)}. 

IsBetterThan(W_GCF(5', Ai, ..., At), m) 

Instance. A global cost function W_GCF, a scope S with domains for the variables 

in S, values for the parameters Ai,..., At, and a fixed integer m. 

Question. Does there exist a tuple £ G such that W_GCF(S', Ai,..., At)(£)<m? 
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We can then define the tractability of a global cost function. 

Definition 10. A global cost function W_GCF(S', Ai,..., At) is said to be tractable iff 
the problem IsBETTERTHANfW_GCF(S', yli,... ,At),m) is in P. 

For a tractable global cost function Ws =W_GGF(S', Ai,..., At), the time complexity of 
computing min{lFs} is bounded above by a polynomial function in the size of the input, 
including the scope, the corresponding domains, the other parameters of the global cost 
function, and log(TO). 

We introduce tractable r-projection-safety global cost functions, which remain tractable 
after applying r-EPTs. 

Definition 11. We say that a global cost function W-GCF{S, Ai,..., At) is tractable 
r-projection-safe iff: 

• it is tractable and; 

• any global cost functions that can be derived from W-GCF{S, Ai ,..., At) by a series 
of r-EPTs is also tractable. 

The tractability after r-EPTs depends on r. We divide the discussion of tractable 
r-projection-safety into three cases: r = 0, r > 2 and r = 1. In the following, given 
a tractable global cost function Ws, we denote by Vr(lFs) the global cost function 
resulting from the application of an arbitrary finite sequence of r-EPTs on Ws- 


3.1. Tractability and 0-EPTs 

When r = 0, EPTs are performed to/from W 0 . This kind of EPTs is used when 
enforcing Node Consisten cy ( NC*) but also in 0-inverse consistency [^, and strong 
0-inverse consistency 


We show that if a global cost function is tractable, it remains tractable after applying 
such EPTs. 


Theorem 1. Every tractable global eost function is tractable 0-projection-safe. 

Proof. Consider a tractable global cost function Ws = WJGCF{S, Ai,..., At). Clearly, 
Ws and Vo (IPs) only differ by a constant, i.e. there exists a~ and , where a“, € 

{0,..., T}, such that: 

Vo(IPs) W = Ws{t) © a+ © a”, for all £ e 

If Ws{t} = minjiPs} for some £ G L'®, then Vo(IPs)(£) = min{Vo(lPs)}. If Ws is 
tractable, so is Vo(IPs). □ 


3.2. Tractability and EPTs of order greater than 2 

When r > 2, EPTs are performed to/from r-arity cost functions. This is required 
for enforcing higher order consistencies and is used in practice in ternary cost functions 
processing j6l[ and complete fc-consistency (^ . 

If arbitrary sequences of r-EPTs are allowed, we show that tractable global cost 
functions always become intractable after some sequence of r-EPT applications, where 
r > 2. 
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Theorem 2. Any tractable global cost function W_GCF(5', Ai,..., At) returning finite 
costs is not tractable r-projection-safe for r > 2, unless P = NP. 

Proof. Let us first define the binary constraint satisfaction problem ArityTwoCSP as 
follows. 

ArityTwoCSP (T, W'*) 

Instance. A CSP instance where every constraint Cg G involves two 

variables, i.e. [S'! = 2. 

Question. Is the CSP (A, W^) satisfiable? 

ArityTwoCSP is NP-hard as graph coloring can be solved through a direct modeling 
into ArityTwoCSP. We reduce the problem ArityTwoCSP(A, W^) to the problem 
IsBetterThan(V 2 (ILA), T), where Wx = W_GC(A, Ai, ..., At) is an arbitrary global 
cost function using only finite costs. We first construct a CFN (A, W U {Wx}, T). The 
upper bound T is a sufficiently large integer such that T > Wx{f) for every £ G L‘®, 
which is always possible given that Wx remains finite. This technical restriction is not 
significant: if a global cost function Ws maps some tuples to infinity, we can transform 
it to another cost function Wg such that the infinity costs are replaced by a sufficiently 
large integer p such that p » max{Ws{i) | ^ G L'^ A Ws(i) +oo}. 

The cost functions Ws G W \ {Wx} are defined as follows: 

TT 7 //)\ f 0) if ^ is accepted by Cq G 

= I T, otherwise ^ * 

From the CFN, V 2 can be defined as follows: for each forbidden tuple £[5] in each 
Cg G W^, we add an extension of T from Ws to Wx with respect to i[S] into V 2 . Under 
this construction, V 2 (IUy)(^) can be represented as: 

V2{Wx){e) = Wx{t)® 0 VFs(€[5]) 

Wsew 

For a tuple I G L'^, '^ 2 (Wx)(^) = T iff £ is forbidden by some Cg in W^. As a result 
IsBetterThan(V 2 (VUt), T) is satisfiable iff ArityTwoCSP (A, W^) is satishable. As 
ArityTwoCSP is NP-hard, IsBetterThan(V 2 (VF_GC'), T) is not polynomial, unless 
P = NP. Hence, V 2 (W-GC) is not tractable, and then, W-GC is not tractable 2- 
projection-safe, unless P = NP. □ 


3.3. Tractability and 1-EPTs 

When r = 1, 1-EPTs cover 1-projections and 1-extensions, which are the backbone 
of the consistency algmithms of (G)AC * FD(G)AC* [H, 0,153, (weak) 

ED(G)AC* [13 23 Ho], VAC, and OSAC In these cases, tractable cost functions 


are tractable 1-projection-safe only under special conditions. For example, Lee and Leung 
define flow-based projection-safety based on a flow-based global cost function. 

Definition 12 (Flow-based Hi, [13). A global cost function W-GCF{S, Ai, ..., At) is 
flow-based iff it can be represented as a flow network G such that the minimum cost 
among all maximum flows between a fixed source and a fixed destination is equal to 
uim{W.GCF{S, Ai,..., At)}. 
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Definition 13 (Flow-based projection safe [H, 11^). A global cost function 
W-GCF{S, Ai,..., At) is flow-based projection-safe iff it is is flow-based, and is still 
flow-based following any sequence of 1-projections and 1-extensions. 


Lee and Leung further propose sufficient conditions for tractable cost func¬ 

tions to be flow-based projection-safe. Flow-based projection-safety implies tractable 
1-projection-safety. We state the result in the following theorem. 


Theorem 3. Any flow-based projection-safe global cost function is tractable 1-projection- 
safe. 


Proof. Follows directly from the tractability of the minimum cost flow algorithm. □ 


However, tractable cost functions are not necessarily tractable 1-projection-safe. One 
example is W_2SAT, which is a global cost function derived from an instance of the 
polynomial 2SAT problem. 

Definition 14. Given a set of Boolean variables S, a set of binary clauses F, and a 
positive integer c, the global cost function W_2SAT(S', F, c) is defined as: 


W.2SAT{S,F,c){i) 


0, if £ satisfies F 
c, otherwise 


W_2SAT is tractable, because the 2SAT problem is tractable [^. However, it is not 
tractable 1-projection-safe. 


Theorem 4. W_2SAT is not tractable 1-projection-safe, unless P = NP. 


Proof. Let us first define the WSAT-2-CNF problem. 


WSAT-2-CNF 

Instance. A 2-CNF formula F (a set of binary clauses) and a fixed integer k. 
Question. Is there an assignment that satisfies all clauses in F with at most k 
variables set to true ? 


WSAT-2-CNF was shown NP-hard in page 69]. We reduce it to the problem 
ISBETTERTHAN(Vi(IF_2S'Ar), T). 

We construct a particular sequence of 1-projections and/or 1-extensions Vi such that 
the WSAT-2-CNF instance can be solved using Wx = WJ2SAT{X,F,k -\- 1) from the 
Boolean CFN N = {X,W U {Wx}, fc + 1). W only contains unary cost functions Wi, 
which are dehned as follows: 


Wflv) 


1 , if n = true' 
0 , otherwise 


Based on N, we construct Vi as follows: for each variable Xi € X, we add an extension 
of 1 from Wi to Wx with respect to the value true into Vi. As a result, a tuple £ with 
Vi{Wx){£) = k' < k contains exactly k' variables set to true (because every Xi = true 
incurs a cost of 1) and also satisfies F (or it would have cost k-\-l = T). Thus, the WSAT- 
2-CNF instance with threshold k is satisfiable iff IsBetterThan(Vi(HA’), fc -|- 1) is 
satisfiable. As WSAT-2-CNF is NP-hard, IsBetterThan(Vi(W_ 2S'AT), fc-h 1) is not 
polynomial, unless P = NP. Hence, VifWJISAT) is not tractable, and then, WJ2SAT 
is not tractable 1-projection-safe, unless P = NP. □ 

10 




When the context is clear, we use tractable projection-safety, projection and exten¬ 
sion to refer to tractable 1-projection-safety, 1-projection and 1-extension respectively 
hereafter. 


4. Polynomial DAG-Filtering 


Beyond flow-based global cost functions we introduce now an additional class 

of tractable projection-safe cost functions based on dynamic programming algorithms. 
As mentioned by Dasgupta et al. (^ . every dynamic programming algorithm has an 
underlying DAG structure. 


Definition 15 (DAG). A directed acylic graph (DAG) T = {V,E), where V is a set of 
vertices (or nodes) and E C V x V is a set of directed edges, is a directed graph with no 
directed cycles, and: 


• An edge {u, v) G E points from u to v, where u is the parent of v, and v is the 
child of u; 

• A root of a DAG is a vertex with zero in-degree; 

• A leaf of a DAG is a vertex with zero out-degree; 

• An internal vertex of a DAG is any vertex which is not a leaf; 


We now introduce the DA G filterahility of a global cost function. 

Definition 16 (DAG-filter). A DAG-filter for a cost function Ws is a DAG T = {V,E) 
such that: 


• T is connected; 

• V = is a set of cost function vertices each with a scope Si, among which 

vertex Ws is the root of T; 

• Each internal vertex ws. in V is associated with an aggregation function ft that 
maps a multiset of costs {aj \ aj G [0 ... T]} to [0 ... T] and is based on an asso¬ 
ciative and commutative binary operator; 

• For every internal ws^ G V , 

— the scope of cosi is composed from its children’s scopes: 

S.= U S, 


— LOSi is the aggregation of its children: 

= Mi^sMSA) I (^s.,a;s,) G E}); 

— min is distributive over f^: 

min{wsj = /j({min{wsJ | (wsi,ws,.) G E}). 
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When a cost function Ws has a DAG-filter T, we say that Ws is DAG-filterable by 
T. Note that any cost function Ws has a trivial DAG filter which is composed of a single 
vertex that defines Ws as a cost table (with size exponential in the arity |S'|). 

In the general case, a DAG-filter (recursively) transforms a cost function into cost 
functions with smaller scopes until it reaches the ones at the leaves of a DAG, which may 
be trivial to solve. The (minimum) costs can then be aggregated using the fi functions at 
each internal vertex to get the resultant (minimum) cost, through dynamic programming. 
However, further properties on DAG-filters are required to allow for projections and 
extensions to operate on the DAG structure. 

Definition 17 (Safe DAG-filter). A DAG-filter T = {V,E) for a cost function Ws is 
safe ijf: 

• projection and extension are distributive over fi, i.e. for a variable x G S, a cost 
a and a tuple £ G , 

- uJSiii[Si]) © i^x.Siia) = /i({wSfc(^[S'/c]) © J^x.Sfc(a) I i^Si,^Sk) ^ E}), and; 

- uJs,{£[Si]) e i^x,Si{a) = fi{{i^SkWSk]) e i^x,Sk{oi) I G A}), 

where the function v is defined as: 

. . fa, if X G Sj, 

^x,Sj\cf) otherwise. 

The requirement of a distributive fi with respect to projection and extension at 
each vertex in T implies that the structure of the DAG is unchanged after projections 
and extensions. Both operations can be distributed down to the leaves. We formally 
state this as the following theorem. Given a variable x, with a value a G D{x), and 
a cost function Ws, we denote as Wg the cost function obtained by the application of 
Project (S', {x}, (t’jja) on Ws if x S S or Ws otherwise. 

Theorem 5. For a cost function Ws with a safe DAG-filter T = (V,E), Wg has a safe 
DAG-filter T' = (V',E'), where each ujg^ G V is defined as: 

' = / ©^'a:.Sfc(a), ifoJSi *5 « W ofT, 

f LOSi, otherwise. 

and {ujg.,ujg^) G E' iff (ujSi,uJSk) ^ T' is isomorphic to T. Moreover, both 

bj'g. G V and G V are associated with the same aggregation function fi. 

Proof. Follows directly from Definition [TT] □ 

Two common choices for fi are © and min, with which distributivity depends on 
how scopes intersect. In the following, we show that the global cost function is safely 
DAG-filterable if the internal vertices that are associated with © have children with non¬ 
overlapping scopes, and those associated with min have children with identical scopes. 

Proposition 2. Any DAG-filter T = {V,E) for a cost function Ws such that 

• each LuSi G V is associated with the aggregation function fi = ©; 
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• for any distinct uJSj,uJSk G V, which are children of tusi, Sj D Sk = 0 ■ 
is safe. 

Proof. We need to show that min, projection and extension are distributive over ©. Since 
the scopes of the cost functions do not overlap, min is distributive over ©. We further 
show the distributivity with respect to projection (©), while extension (©) is similar. 
We consider an internal vertex G V. Given a variable x € Si, a cost a, and a tuple 
f G L'®, since the scopes of the cost functions {wgj, | (ws^,wsj,) G E} are disjoint, there 
must exist exactly one cost function ujSj such that x € Sj, i.e.: 

ujSi{i)ea = © a) © ^ ujs^{i[Sk]) 


The result follows. □ 

Proposition 3. Any DAG-filter T = {V,E) for a cost function Ws such that 

• each LuSi € V is associated with the aggregation function fi = min; 

• VwSj G V, which are children of ujsa Sj = Si. 

is safe. 

Proof. Since the scopes are completely overlapping, 
minjws^} = 

It is trivial to see that projection and extension are distributive over fi. The result 
follows. □ 

We are now ready to define polynomial DAG-filterability of global cost functions. As 
safe DAG-filters can be exponential in size, we need to restrict to safe DAG-filters of 
polynomial size by restricting the size of the DAG to be polynomial and by bounding 
the arity of the cost functions at the leaves of the DAG. 

Definition 18 (Polynomial DAG-filterability). A global cost function W-GCE{S, Ai,... ,At 
is polynomially DAG-filterable iff 

1. any instance Ws of WJGCF{S, Ai,..., At) has a safe DAG-filter T = {V, E) 

2. where \ V\is polynomial in the size of the input parameters ofWJGCF{S, Ai,..., At); 

3. each leaf in V is a unary cost function, and 

4 . each aggregation function fi associated with each internal vertex is polynomial-time 
computable. 


min { min {a;s,(^)}} 
min { min 

/i({min{a;sj | G E}) 
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Dynamic programming can compute the minimum of a polynomially DAG-filterable 
cost function in a tractable way. Projections and extensions to/from such cost functions 
can also be distributed to the leaves in T. Thus, polynomially DAG-filterable global cost 
functions are tractable and also tractable projection-safe, as stated below. 

Theorem 6. A polynomially DAG-filterable global cost function WJGCF{S, Ai,..., At) 
is tractable. 

Proof. Let Ws be any instance of W-GCF{S, Ai,..., At), and T = {V,E) be a safe 
DAG-filter for Ws. Algorithm 0] can be applied to compute min{VTs}. The algorithm 
uses a bottom-up memoization approach. Algorithm 0] first sorts V topologically at line 
01 After sorting, all the leaves will be grouped at the end of the sorted sequence, which 
is then processed in the reversed order at line01 If the vertex is a leaf, the minimum is 
computed and stored in the table Min at line 01 Otherwise, its minimum is computed 
by aggregating {Min[w5^] | (w5^,W5^) e E}, which have been already computed, by the 
function ft at line 01 Line 01 returns the minimum of the root node. 

The computation is tractable. Leaves being unary cost functions, line 01 is in 0{d), 
where d is the maximum domain size. For other vertices, line 01 calls ft, which is assumed 
to be polynomial time. The result follows. □ 


1 

2 

3 

4 

5 

6 


Function Minimum (IPs) 

Form the corresponding filtering DAG T = {V,E)\ 
Topologically sort V\ 

foreach € F in reverse topological order do 
if uJSi is a leaf of T then 
I MinItJsJ ~ min{ijJSi} ; 
else 

|_ Min[(^5j /i({Min[a;Sfc] | {uds^^^Sk) ^ E}) ; 
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return Min[IFs]; 


Algorithm 4: Gomputing minjWs} 

Note that Algorithm 01 computes the minimum from scratch each time it is called. 
In practice, querying the minimum of cost function Ws when Xi is assigned to v for 
different values v can be done more efficiently with some pre-processing. We define 
Min'^[uiSj,Xi,v\ that stores min{a;Sj(£) | Xi € Sj /\ (.[xi] = u}. Min'^[uiSj,Xi,v\ can be 
computed similarly to Algorithm 01 by using the equation: 

{ ojsfiv), if LOSj is a leaf of T and Sj = {xi} 

minjwsj}, if ojSj is a leaf of T and Sj {si} 

fj{{Min+[ujSt,,x^,v]) I (ujSt.ojSk) e E}), otherwise 

Whenever we have to compute the minimum for Xi = v, we simply return Min'^[Ws, Xi, u]. 
Gomputing Min~^[Ws,Xi,v] is equivalent to running Algorithm 01 nd times, where n is 
the number of variables and d the maximum domain size. However, this can be reduced 
by incremental computations exploiting the global constraint semantics, as illustrated on 
the W_Grammar"“’' global cost function in Section 01 
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We now show that a polynomially DAG-filterable cost function is tractable projection- 
safe. The following lemma will be useful. For a variable x G S and a value v G 
D{x), we denote as W'-GCF{S, Ai,..., At) the cost function obtained by applying 
Project (5, {cc}, {v),a) to a global cost function W-GCF{S, Ai,... ,At). 

Lemma 1. If a global cost function W-GGF(S, Ai,..., At) is polynomially DAG-filterable, 
W JGGF{S, Ai,..., At) is polynomially DAG-filterable. 

Proof. Suppose W-GGF{S, Ai,..., At) is polynomially DAG-filterable. Then any in¬ 
stance Ws of it has a safe filtering DAG T = {V,E). By Theorem [SI we know that Wg, 
the corresponding instance of W'-GGF{S, Ai,..., At), has a safe DAG filter T', which 
is isomorphic to T, has polynomial size, and polynomial-time computable ft associated 
with each internal vertex. The leaves of T' only differ from those of T by a constant. 
The result follows. □ 

Theorem 7. A polynomially DAG-filterable global cost function WJGGF{S, Ai,..., At) 
is tractable projection-safe. 

Proof. Follows directly from Theorem [6] and Lemma [1] □ 

As shown by Theorem 0 a polynomially DAG-filterable cost function Ws remains 
polynomially DAG-filterable after projection or extension. Algorithm [5] shows how the 
projection is performed from Ws and Wi, where Xi G S. Lines [5] to |S] modify the leaves 
of the filtering DAG, as suggested by Theorem |5l 

Lines|n|to|S]in Algorithm |S] show how incrementality can be achieved. If Wi or D{xi), 
Xi G S, are changed we update the entry Min[uiSi] line [SI which corresponds to the 
leaf uJSi, where Xi G Si. The change propagates upwards in lines |S] and |5l updating all 
entries related to the leaf wg.. The table FW can be updated similarly. 


1 

2 

3 
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Precondition: Ws is polynomially DAG-filterable with the filtering DAG 
T={V,E)- 

Procedure Project {S, {xi}, (v), a) 

Wi(v) := Wi{v) 0 a ; 

foreach G V such that Sj = {0:^} and cosj is a leaf of T do 
uJSj{v) := ujs,i{v) 0 a ; 

|_ Min[a;sJ := min{uJSj} ; 


5 

6 

7 
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Topologically sort V ; 

foreach S F in reverse topological order do 
if ojSj is not a leaf and Xi G Sj then 
L Min[a;sJ := /*({Min[wsJ | (a;s,.,a;sj G E}) ; 


Algorithm 5: Projection from a polynomially DAG-filterable global cost function 

The time complexity of enforcing GAG* on a polynomially DAG-filterable global cost 
function heavily depends on preprocessing, as stated in the following corollary. 

Corollary 1. If the time complexity for pre-computing the table Min'^ for a polynomially 
DAG-filterable cost function Ws is 0{K{n,d)), where K is a function of n = and 
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maximum domain size d, then enforcing GAC* on a variable Xi € S with respect to Ws 
requires 0{K(n, d) + d) time. 

Proof. Computing the minimum of Ws when Xi = v, where Xi G S and v G D{xi), 
requires only constant time by looking up from Min'^. By Proposition [TJ the time 
complexity is 0{K{n, d) + d) time. □ 

We have presented a new class of tractable projection-safe global cost functions. 
Algorithm 3] gives an efficient algorithm to compute the minimum cost. In the next 
Section, we give an example of such a global cost function. More examples can be found 
in the associated technical report 0]. 

5. A Polynomially DAG-filterable Global Cost Function 

In the following, we show that W_Grammar’'“’', W_Among"“'’,W_Regular’'“’’, 
W_Max, and W_MiN are polynomially DAG-filterable using the results from the previ¬ 
ous section. 

5.1. The W_Grammar’'“’’ Cost Function 

W_Grammar’'“’' is the cost function variant of the softened version of the hard global 
constraint Grammar defined based on a context-free language. 

Definition 19. A context-free language L(G) is represented by a context-free grammar 
G= (S,A,P,Ao), where: 

• T, is a set of terminals; 

• N is a set of non-terminals; 

• P is a set of production rules from N to (EU A)*, where * is the Kleene star, and; 

• Aq G N is a starting symbol. 

A string r belongs to L{G), written as t G L{G) iff t can be derived from G. 

Without loss of generality, we assume that (1) the context-free language L{G) does not 
contain cycles, and (2) the strings are always of fixed length, representing values in tuples. 

Assume S = {xi,... ,Xn}. We define to be a string formed by a tuple i G L‘®, 
where the character of Tf is £[xij. The hard constraint GRAMMAr(S', G) authorizes a 
tuple £ G if T( G L{G) [s^. Using the violation measure var by Katsirelos et al. [40| . 
the W_Grammar"“'’ cost function is defined as follows. 

Definition 20 (W_Grammar"“’' 0). Given a context-free grammar G = 
{Tt, N, P, Aq). W_Grammar'"“’'(S', G) returns min{iJ(T^, r^) | Ti G L{G)} for each tuple 
£ G , where H{ti,T 2 ) returns the Flamming distance between ti and T 2 . 

Example 1. Consider S = {xi,X 2 ,XQ,X 4 f\, where D{xi) = {a, b, c} for z = 1... 4. Given 
the grammar G = ({a, b, c}, {Aq, A, B, G}, P, S) with the following production rules. 

Aq —y AA 
A -G a\AA\BG 
B -G h\BB 

G -G c I GG 
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The cost returned hy W_Grammar’'“’’( 5', G')(f) is 1 if £ = {c,a,b,c). The assignment of 
xi needs to he changed so that L{M) accepts the corresponding string aabc. 

Theorem 8 . W_Grammar^“’'(5', G) is a polynomially DAG-filterable and thus tractable 
projection-safe global cost function. 

Proof. We adopt the dynamic programming approach similar to the modified CYK 
parser (d^ . Without loss of generality, we assume G is in Chomsky normal form, i.e. each 
production rule always has the form A —>• a or A —>• BC, where A G N, B,C € N\ {Aq} 
and a G S. 

Define ujg. . = W_GRAMMAR"“’'(5'i_j, Ga), where i < j, Sij = {xi.. .xj} C S, and 
Ga = (Y, N, P, A) for A G N. By definition, 

W_Grammar™’’(5', G)(£) = 


The base cases Wg. . is defined as follows. Define Tia = {a | A —)• a} to be the set of 
terminals that can be yielded from A. 


^Si i (^) — 


min{17“(f[a;i]) I (A-)• a) e P}, if 0 

T, otherwise 


The unary cost function U°‘{£[xi\) is defined as follows. 

ur{v) = 


if r = a; 
otherwise 


( 1 ) 

( 2 ) 


Other cost functions Wg. where i < j, are defined as follows. Let Na = {{B, C) \ A ^ 
BC} be the set of pairs of non-terminals that are yielded from A. 




min I (A ^ BC) G P}, 

K—— L 

T, 


if ^ 0 
otherwise 

(3) 

□ 


The associated filtering DAG {V, E) is illustrated in Figure [T] on Example [TJ In 
Figure [11 leaves are indicated by double circles, corresponding to the unary cost function 
in equation [2] Vertices with min or © aggregators are indicated by rectangles and circles 
respectively, corresponding to cost functions in equation [3] if i ^ j, or equation [T] 
otherwise. As shown in Figure [1] the root node W_Grammar is first split by the 
production rule Aq AA. One of its children u}g_^ ^ leads to the leaf Uf according to 
the production rule A ^ o. The DAG uses only © or min as aggregations and they 
satisfy the preconditions that allow to apply propositions [5] and [3] The cost function 
is therefore safely DAG-filterable. Moreover, the corresponding DAG (V, E) has size 
|V| = 0(|P| • |S'p) polynomial in the size of the input. The leaves are unary functions 
{U°‘} and by Theorem [71 the result follows. 

Note that Theorem |S] also gives a proof that W_Regular’'“'’ is tractable projection- 
safe. Indeed, a finite state automaton, defining a regular language, can be transformed 
into a grammar with the number of non-terminals and production rules polynomial in 
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Figure 1: The DAG corresponding to W_Grammar'"“’^ 


the number of states in the automaton. Then, W_Among'"“’' is also tractable projection- 
safe since the tuples satisfying an Among global constraint can be represented using a 
compact finite state counting automaton 

Function GrammarMin in Algorithm [5] computes the minimum of W_Grammar'"“’"(S', G). 
We first compute the minimum of the unary cost functions in the table u[i,c\ at lines [6] 
to [SI The table / of size n x n x |A^| is filled up in two separate for-loops: one at linejS] 
according to the equation (TJ and another one at line [S] for the equation [31 The result is 
returned at linejS! 

Theorem 9. The function GrammarMin in Algorithm computes the minimum of the 
global cost function W-GrammA'r'"°‘^{S,G = {Tt, N, P, Aq)) in time 0(n(i ■ |I]|-|-• |P|), 
where n = [S'! and d is the maximum domain size. 

Proof. Lines [S] to [S] take 0(nd ■ |E|). The first for-loop at lines [S] to [S] requires 0{n ■ |P|), 
while the second one at lines initoin] requires 0{n^ ■ IT’D. The overall time complexity is 
0(nd • |E| -b n ■ |P| -b n3 ■ |P|) = 0(nd ■ |S| -b • |P|). □ 

As for incrementality, Algorithm [7] gives the pre-processing performed on top of Al¬ 
gorithm |6j based on the weighted CYK propagator used in Katsirelos et al. [^. We 
compute the table / at line [7] using Algorithm |6l Then we compute the table F at lines [7] 
tojTjusing the top-down approach. For each production A A 1 A 2 , lines |7| and |7| compute 
the maximum possible costs from their neighbors. An additional table marked[i, j, A] is 
used to record whether the symbol A is accessible when deriving sub-strings at positions 
i to j in G. Each time we need to compute the minimum for Xi = v, we just return 
m.m{U°‘{v) © F[i, i, A] © f[0, n — 1, Aq] \ {A i-a v) G P A marked[i, i, A]}, or T if such 
production does not exist. 

Corollary 2. Given Ws = W_Grammar"“’'(5', G = (F,, N, P, Aq)). Enforcing GAG* 
on a variable Xi G S with respect to W_Grammar requires 0{nd ■ |E| -b • |P|) time, 
where n = [S'! and d is the maximum domain size. 
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Function GraininarMin(5’, G) 
for z := 1 to n do 
for c ^ E do 

|_ u[i,c] := miii{t/f}; 


4 

5 

6 
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for z := 1 to n do 

foreach A € N do /[z, z, A] := T] 
foreach (A, a) such that {A\-^ a) G P do 
1_ /[*,*,^] = niin{/[i,*, A],'u[z,a]} ; 


8 return GramunarPartialMin(S', G, 1); 
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Function GrammarPartialMin(S', G, start) 
for len := 2 to n do 

for i := start to n — len + 1 do 
j := i + len — 1 ; 

foreach A € N do f[i,j, A] := T; 
foreach (A, ^1,^2) such that (A 1-^ A1A2) € P do 
for /c := 1 to j — 1 do 

L /[*G',^] := min{/[i,j, A],/[i,fc, Ai] ® f[k + I, j, A2]} ; 
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return f[l,n,Ao]; 


Algorithm 6: Finding the minimum of W_Grammar’'“’’ 


Proof. Using a similar argument to that in the proof of Theorem [21 Algorithm [7] requires 
0{nd- |S| + n^- |P|) time. The result follows directly from Corollary [1] and Theorem|21 □ 

Algorithm [5] shows how projection is performed between W-Grammar^'^^ and Wp, 
and how incrementally can be achieved. Line [8] modifies the leaves Up for each c G E, 
while lines [5] and |S] update the corresponding entries in the tables u and / respectively. 
The change is propagated up in / at line [H corresponding to derivation of sub-strings 
with positions from p to the end in G. 

5.2. The W_Among’'“’’ Cost Function 

W_Among’'“’' is the cost function variant of the softened version of Among using 
the corresponding variable-based violation measure . 

Definition 21. f^I Given a set of values V, a lower bound lb and an upper bound ub such 
that 0 < lb < ub < [S'!. W_Among"“’'( 5',ii6, U) returns max{0, lb — t{£,V),ti£,V)— 
ub}, where t{£, V) = |{i | £[xi] G U}| for each tuple I G T{S). 

Example 2. Consider S = {xi,X 2 ,X 3 }, where D{xi) = D{x 2 ) = D{x 3 ) = {a,b,c,d}. 
The east returned by W_Among’'“’'( 5', 1, 2, {a, b}){£) is: 

• 0 if £ = {a,b,c,d); 

• 1 if £ = {c, d, c, d); 
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Procedure GrainmarPreCompute(5’, G) 
F[l^n^ Aq] := GrainmarMin(5’, G); 

for i := 1 to n do 
for j := i to n do 
I foreach A € N do 


F[i,j,A] :=-T; 
marked[i^ j, A] := false; 
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marA:ed[l, n, ^o] := true; 
for len := n down to 2 do 
for 7 := 1 to n — len + 1 do 
j := i len — 1 ; 

foreach {A, Ai^ A 2 ) such that {A\-^ A 1 A 2 ) € P Amarked[i,j^ A] do 
for k := i to j do 

niarked[i^ k, Ai] := true; 

F[i,k,Ai] := max{F[i, k, Ai], F[i, j, A] Q f[k + l,j,A 2 ]); 
marked[k + l,j, A 2 ] := true; 

F[k + 1, j, A 2 ] := max(i^[fc + 1, j, A 2 ], F[i, j, A] 0 f[i,k,Ai]); 


Algorithm 7: Pre-computation for W_Grammar’'“’’ 


• 2 if i= {a, b, a, b); 


Theorem 10. W_Among““’’(S', lb, ub, V) is polynomially DAG-filterable and thus tractable 
projection-safe. 


Proof. We first define two base cases U}' and uj. The function Uy is the cost function 
on Xi defined as: 


uriv) 


0 , if u e V; 
1, otherwise 


and A {v) = 1 — UY is its negation. 

We construct W_Among’^“’’ based on andt/^. Define = W_AMONG"“’'(5'i, j, j, 1^), 
where Si = {xi,..., Xi} C S. By definition, Si = S'^-iUjcCi} and So = 0- W_Among““’’(5, lb, ub, V) 
can be represented by the sub-cost functions Wg. as: 


W_Among"“’'(5',/&,u 6,T)(^) = min W} 

lb<j<ub ^ 


and each Wg. can be represented as: 


^sM) 


u;l_,m-i])(BUj{i[xi]) 

mm < • _ V 
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for i > 0 

for j > 0 and i > 0 
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Procedure GrammarProject (5, {xp}^ (v), a) 
Wp{v) := Wp{v)^a; 
for c € E do 

U^iv) := U^iv) © a; 

1_ u[p,c] := mini{7°}; 


5 foreach {A, a) such that (A >->■ a) € P do f[p,p, A] = min{/[p,p, A\,u[i,aW ; 

6 GrammarPartzalMin (S,G,p); 

7 GrammarPreCompute (S', G); 


Algorithm 8: Projection from W_Grammar’'“’’(S, G = {T,, N, P, Aq)) 


The equations form a DAG (V,E), as illustrated in Figure [5] using Example [5] In Figure 
m leaves are indicated by double-lined circles. Vertices with min or © aggregators are 
indicated by rectangles and circles respectively. The DAG has a number of vertices 
\V\ = 0(ub-n) = Oip?) and uses only using © or min as aggregations with proper scopes 
properties. By propositions [2] and [3j W_Among’'“’’ is safely DAG-filterable. Moreover, 
each leaf of the DAG is a unary cost functions. By Theorem [3 the result follows 



Figure 2: The DAG-filter corresponding to W_Among’'“'' 


□ 

Function AmongMin in Algorithmj^computes the minimum of the W_Among’'“’'(S', lb, ub, V) 
cost function according to Theorem llHI Lines [5] to 13 compute the minimum costs returned 
by each additional unary cost functions and store at the arrays u and u. Lines 13 to [3 
builds up the results by filling the table / of size n x ub according to the formulation 
stated in the proof of Theorem [T3 and return the result at line |3 The complexity is 
stated in Theorem [TT] as follows. 

Theorem 11. Function AmongMin in Algorithm\^ computes the minimum o/W_Among'"“’'(S', lb, ub, V) 
and requires 0{n{n + d)), where n = IIS'! and d is the maximum domain size. 

Proof. Lines [3 to [3 in Algorithm [3 take O(nci). Lines [3 to [3 requires 0(n • n6). Since 
is bounded by n, the result follows. □ 

Corollary 3. Given Ws = W_Among"'^’'( 5', it6, V). Enforcing GAG* on a variable 
Xi € S with respect to Ws requires 0{nd(n + d)). 
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Function AmongMin(S', lb, ub, V) 
for i = 1 to n do 

u[i] := minjC/^}; 
u[i] := miii{C/f}; 

for j = 0 to ub do f[0,j] := j ; 
for i = 1 to n do 

/[*,0] := /[* - 1,0] ©?l[z] ; 

for j = 1 to ub do 

L := niin{/[i - l,j - l]®u\i]J[i - l,j] ©71^} 

return mmib<j<ub{f[n,j]} ; 


Algorithm 9: Finding the minimum of W_Among'^“’’ 


Proof. Follow directly from Corollary [T] and Theorem [TT] . 


□ 


5.3. The W_Regular’'“'’ Cost Function 

W_Regular'““’’ is the cost function variant of the softened version of the hard con¬ 
straint Regular based on a regular language. 

Definition 22. A regular language L{M) is represented by a deterministic finite state 
automaton (DFA) M = {Q,'F,5,qo,F), where: 

• Q is a set of states; 

• T, is a set of characters; 

• The transition function 6 is defined as: S : Q x H Q; 

• qo & Q is the initial state, and; 

• F Q Q is the set of final states. 


A string t lies in L{M), written as t G L(M), iff t can lead the transitions from qo to 
qf G F in M 


The hard constraint Regular(S', M) authorizes a tuple i G L(S) if G L{M), where 
T£ is the string formed from i [^. A W_Regular’'“'' cost f unction is defined as foll ows. 
derived from the varia ble-based violation measure given by iBeldiceanu et ahl l2004l and 


van Hoeve et al. 2006 


Definition 23. Given a DFA M = { Q , E, 5, qo, F). The cost function W_Regular'““’’(S', M) 

returns min{i/(r^,Ti) | Ti G L{M)} for each tuple I G L{S), where F[{ti,T 2 ) returns the 
Hamming distance between ti and T 2 . 


Example 3. Consider S = {xi,X 2 ,Xo}, where D[xi) = {o} and D{x 2 ) = D{xo) = 

{a, b}. Given the DFA M shown in Figure\^ The cost returned by W_Regular'^“’’(S', M){tj 
is 1 if t = (a,b,a). The assignment of xo need changed in the tuple {a,b,a) so that L{M) 
accepts the corresponding string aha. 
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Figure 3: The graphical representation of a DFA. 


Theorem 12. W_Regtjlar'““’’(5', M) is polynomially DAG-filterable and thus tractable 
projection-safe. 

Proof. W_Regular’'“’’ can be represented as a DAG [13, US] , which directly gives a 
polynomial DAG-filter. In the following, we reuse the symbols Si and (v), which are 
defined in the proof of Theorem [TUI 

Define to be the cost function W_REGULAR'^“’’(S'i, AT,-), where Mj is the DFA 
(Q, E, (5, go, {Pj}))- W_Regular““’’(S', M) can be represented as: 

W.Regular™’'(S,M)(^) = min{a;^^(£)} 


The base cases are defined as: 


U!' 


So 


W = 


0 , 

T, 


if j = 0 
otherwise 


Other sub-cost functions Wg., where i > 0, are defined as follows. Define 6q^ = {{qi,v) \ 
6{qi,v) = qj}. If 5q^ = 0, no transition can lead to qj. 

3 / A. if(5,^. ^0 

{ T, otherwise 

The corresponding DAG is shown in Figure |4j based on Example [S] Again, the same 
notation as in Figure [His used. The DAG has a number of vertices |E| = 0(|S| • |(5|), 
and its leaves are unary functions. The DAG-filter is thus polynomial, and, by Theorem 
0 the result follows. □ 


Together with Theorem 6.11 bv iLee and Leun^ 2012 which showed that this global 
cost function was flow-based tractable, this result gives another reasoning for its tractabil- 
ity. TheoremfTUI also gives another proof of the tractable projection-safety of W_Among'^“’’ 
[b^ . The Among global constraint [l3| can be modeled by the Regular global con¬ 
straint [^, which the size of the corresponding DFA is polynomial in the size of the 
scope. 

Function RegularMin in AlgorithmfTUIcomDutes the minimum of a W_Regular’'“’' (S, M) 
cost function. The algorithm first initializes the table u by assigning min{?7f} to u[i,c] 
at lines ITOl and (TUI Lines ITOl to fTOl fills up the table / of the size n x \Q\. Each entry 
f[i,j] in / holds the value mining.}, which is computed according to the formulation 
stated in Theorem [TUI and returns the result at line HU] 

The time complexity of RegularMin in Algorithm [TU[ can be stated as follows. 
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Figure 4: The DAG-filter corresponding to W-Regular” 


Theorem 13. Function RegularMin in Alaorithm \l(A comvutes the minimum o/W_Regular"“’'(S', M), 
and it requires 0(nd • |(5|), where n and d are defined in TheoremMR 

Proof. Lines 11(11 and fTHl in Algorithm [TU] requires 0{n- |E|). Because |S| is bounded by d, 
the time complexity is 0{n ■ d). Lines ITOl to fTOl reauire 0{nd ■ |(5|), according to the table 
size. Line [10] requires 0{\Q\). The overall time complexity is 0{nd + nci-1(51 + 1(51) = 

0{nd-\Q\). □ 


Function RegularMin(5', M) 

1 for i := 1 to n do 

2 1^ for c S S do u[i,c\ := min{17j^'^^} ; 


3 

4 

5 

6 

7 

8 


/[ 0 , 0 ] :=0 ; 

for Qj eQ\ {go} do /[O, j] := T ; 

for i := 1 to n do 


:=T; 

foreach {qk,qj,c) such that 5{qk,c) = qj do 
L /[*>{] =min{/[*,l],/[Lfc]©u[j,c]}; 


9 return min,^.g_F{/[n, j]} ; 


Algorithm 10: Finding the minimum of W_Regular’'“’’ 


The time complexity is polynomial in the size of the scope S (with associated domains 
of size at most d), and the number of states in the finite automaton M. We state the 
time complexity of enforcing GAG* with respect to W_Regular'^“’’ as follows. 

Corollary 4. Given Ws = W_Regular^“’^(S',M). Enforcing GAG* on a variable 
Xi € S with respect to Ws requires 0{ndf -1(51)) where n and d is defined in TheoremMR 

Proof. Follow directly from Corollary |T] and Theorem [T31 □ 
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5.4- The W_Max/W_Min Cost Functions 

Definition 24. Given a function f{xi,v) that maps every variable-value pair (xi,v), 
where v e D{xi), to a cost in {0... T}. 

• The W_Max(5', /)(£), where £ G £(S'), returns ma,yi{f(xi,£[xi]) \ Xi G S'}; 

• The W_Min(S, /)(^), where i G £(S), returns mhi{f{xi,£[xi]) \ Xi G S). 

Note that the W_Max and W_Min cost functions are not a direct generalization of 
any global constraints. Therefore, their name does not follow the traditional format. 
However, they can be used to model the Maximum and Minimum hard constraints Q. 
For examples, the MAXiMUM(xmaa:, S) can be represented as Xmax = W_Max(S,/), 
where /(xi, v) = v. 

Example 4. Consider S = {xi,X 2 ,X 3 }, where D[xi) = {1,3}, D{x 2 ) = {2,4}, and 
Dixz) = {2,3}. Given f{xi,v) = 3xu, the cost of the tuple (1, 2,3) given by W_Max(S, /) 
is 9, while that o/(3,4, 2) is 12. 

Theorem 14. W_Max(S, /) and W_Min(S, /) are polynomially DAG-filterable, and 
thus tractable projection-safe. 

Proof. Definition [21] does not lead to a DAG-filter that is tractable-safe. We give another 
DAG-filter of W_Max(S, /) which satisfies polynomial DAG-filter ability, while that of 
W_Min(S, /) is similar. For the ease of explanation, we arrange all possible outputs of 
/ in a non-decreasing sequence A = [oq; cii, • ■ ■, otk], where at < aj iff i < j. The value 
tto = 0 is added into the sequence as a base case. 

We define three families of unary cost functions {Hf'}, {G“} and {F^}. Cost func¬ 
tions {id“ \ Xi G S A u € D{xi)} are unary functions on G ^ defined as 


Hfiv) 


f{xi,v), iiv = u 
T, ii V ^ u 


The unary cost functions F^^ ^ are unary cost functions on xj G S defined as: 


f 0, iiak=fixj,u) 

\ T, otherwise 


Cost functions {G}"” \ Xj G S A ak G A} are unary functions on Xj G S, defined 
recursively as: 

G“°(u) = T 

{ T, a ak > f{xj,u) a,nd\/v,f{xi,v) ^ ak 

G“'“"^('u), if ak < f{xj,u) and \/v,f {x^,v) ^ ak 

(u), Fp {u)} if ak < f{xj,u) and 3v,f{x^,v) = ak 

They give a polynomial DAG-filter for W_Max as follows. 


W_Max(S',/)(£) 


min 

ak€A/\ak=f{xi,v) 


{Hfi£[x.])® 


0 

a:jeS\{a;i} 


Grwxj])} 


(4) 
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Figure 5: The DAG-filter corresponding to W_Max 


represents the choice of the maximum cost component in the tuple, while G“ repre¬ 
sents the choice of each component other than the one with the maximum weight. 

The corresponding DAG (D, E) as shown in Figure based on Example 01 The 
notation is the same as Figure 0] The DAG contains \V\ vertices, where \V\ — 0{nd ■ 
n'^d) = 0{v?d^). By propositions [5] and [31 the decomposition is safely DAG-filterable. 
Moreover, the leaves are unary cost functions. The DAG-filter is polynomial and, by 
Theorem [71 the result follows. □ 

Example 5. Following Example^ the sequence A is defined as: 

A = [oq, oi, a2, Q!3, 04, as, as] 

= [0, /(xi, 1), /(X2,2), /(X3, 2), /(xi, 3), /(X3, 3), /(x2,4)] 

= [0,3,6,6,9,9,12] 


The DAG-filter for W_Max can be represented as: 


W_Max(5', c){£) = min < 


Hl{e[x 2 ]) © cr (£[xi]) © G“'^(£[x 3]), 
i7|(4x3]) © G“^(f[xi]) © G“^(f[x 2 ]), 
© G^ (^[X2]) © G“^ (f [X3]), 
i7|(4x3]) © G“^(f[xi]) © G“^(f[x 2 ]), 
i7|(4^2]) © Gf (f[xi]) © G“^(f[x3]), 
Hl{i[xfi) © G“^ (£[x 2 ]) © G“^ {£[x3]), 






Assume £ = (1,2,3). We first compute the values of {Hf"} and {G“}, incrementally 
starting from oq . The results are shown in Table Ql 
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Table 1: Computing the values of and {G“} 


aj 



G“^ 


do 

- 

T 

T 

T 

ai = f{xi, 1) = 3 

3 

0 

T 

T 

a2 = fix2,2) = 6 

6 

0 

0 

T 

as = f{x3,2) = 6 

T 

0 

0 

0 

a4 = f(xi,3) = 9 

T 

0 

0 

0 

as = f(x3,3) = 9 

9 

0 

0 

0 

ae = f(x2,‘i) = 12 

T 

0 

0 

0 


WJsJlAx(S,c)(i) can be computed using Table\^ which gives the cost 9. 


'W-Max{S,c){£) = min < 


T ©000, 
T ©0©0, 
9®0©0, 
T ©0® 0, 
6©0©T, 
3© T © T, 


■4 




= 9 


Function WMaxMin in Algorithm [TT] computes the minimum of a W_Max(5', c) cost 
function, based on Equation^ The one for W_Min(S', c) is similar. The for-loop at line 
M tried every possible variable-value pair (xi,a) in the non-decreasing order of f{xi,a). 
At each iteration, it first computes the minimum among all tuple £ which £[xi\ = v and 
it is the maximum weighted component in the tuple in line 1111 and update the global 
minimum in line m The variables is then updated in line [TTl They store the 

current minimum of {Gf}, which is used for compute the minimum among tuples with 
i[xi] is not the maximum weighted component. 


1 

2 

3 

4 

5 

6 

7 

8 
9 


Function WMaxMin(5, /) 

for i := 1 to n do g[xi] := T ; 
curMin := T ; 

A := {(xi, x) I Xi G S' A u € D{xi)}; 

sort A in the nondecreasing order of f{xi^v); 

foreach (xi,x) according to the sorted list A do 

curCost := H^{v) © ®j=i,„,,j^ig[xj]-, 
curMin := minjcurMin, curCost}; 
g[xi\ :=mm{g[xi],Gf{v)}; 


10 


return curMin ; 


Algorithm 11: Finding the minimum of W_Max 


The time complexity is given by the theorem below. 
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Theorem 15. Function WMaxMin in Alaorithm \ 1 1\ comvutes the minimum o/W_Max(S', /), 
and it requires 0{nd ■ log(nd)), where n and d are defined in Theorem \ll[ 

Proof. Line [TT] takes 0(nd ■ log(nd)) to sort. The for-loop at line [TT] iterates nd times. 
All operations in the iteration requires 0(1) except line [TO As it is, line [TT] requires 
0[n). By using special data structure like segment trees [1^, the time complexity can 
be reduced to 0(log(n)). The overall complexity becomes 0{nd ■ log(n(i) + nd ■ log(n)) = 
0{nd ■\og{nd)). □ 

Corollary 5. Given Ws = W_Max(5', /). Enforcing GAG* on a variable Xi G S with 
respect to Ws requires 0{ndf ■ log(nfi)), where n and d is defined in Theorem\ll[ 

Proof. Follow directly from Corollary |T] and Theorem [T5| □ 

In this section, we have seen how the minimum of a polynomially DAG-filterable 
global cost function can be computed efficiently, leading to efficient soft local consis¬ 
tency enforcement. However, each newly implemented cost function requires to build a 
corresponding DAG structure with a dedicated dynamic programming algorithm. 

In the next section, we show that, in some cases, it is also possible to avoid this by 
directly decomposing a global cost functions into a CFN in such a way that local consis¬ 
tency enforcement will emulate dynamic programming, avoiding the need for dedicated 
enforcement algorithms. 


6. Decomposing Global Cost Ftmctions into CFNs 

In GSPs, some global constraints can be efficiently represented by a logically equiv¬ 
alent subnetwork of constraints of bounded arities [lOj, [l6| , and are said to be decom¬ 
posable. Similarly, we will show that some global cost functions can be encoded as a 
sum of bounded arity cost functions. The definition below applies to any cost function, 
including constraints, extending the definition in and [iq . 

Definition 25. For a given integer p, a p-network-decomposition of a global cost 
function WJGCF[S, Ai,..., A^) is a polynomial transformation 6p that returns a CFN 
6p{S, Ai,..., Ak) = (Sue, iF, T), where S' n A = 0, such that \/Wt G .F, |T| < p and 
W G L^,W.GCFiS,A^,.. .,Akm = ITs, (f [S^]). 

Definition above allows for the use of extra variables E, which do not appear 
in the original cost function scope and are eliminated by minimization. We assume, 
without loss of generality, that every extra variable x G E is involved in at least two 
cost functions in the decompositionjj Glearly, if W-GCF{S, Ai,..., Ak) appears in a 
GFN P = (At, W, T) and decomposes into (S U E, T, T), the optimal solutions of P can 
directly be obtained by projecting the optimal solutions of the GFN P' = (X U E,yV \ 
{W.GCF{S, Ai,..., Afe)} U X, T) on A. 


^Otherwise, such a variable can be removed by variable elimination: remove x from E and replace 
the Wt involving x by the cost function min^; Wt on T \ This preserves the Berge-acyclicity of the 
network if it exists. 
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6.1. Building network-decomposable global cost functions 

A global cost function can be shown to be network-decomposable by exhibiting a 
bounded arity network decomposition of the global cost function. There is a simple 
way of deriving network-decomposable cost functions from known decomposable global 
constraints. The process goes directly from a known decomposable global constraint to a 
network-decomposable global cost function and does not require to use an intermediate 
soft global constraint with an associated violation measure p. Instead, the global cost 
function will use any relaxation of the decomposed global constraint. 

We say that the cost function Ws is a relaxation of Wg if for all i G L^,Ws{i) < 
Wg{£). We then write Ws < Wg. From a network-decomposable global constraint, it is 
possible to define an associated network-decomposable global cost function by relaxing 
every constraint in the decomposition. 

Theorem 16. Let GC{S, Ai,... ,Ak) be a global constraint that p-network decomposes 
into a classical constraint network {S U E,iF,T) and fg be a function parameterized by 
0 that maps every Ct € T to a cost function feiCr) such that feiCr) < Ct- The global 
cost function 


W.GCF{S,A,,...,A,,fg)i£)= min^ ^ f,(CT)i£'[T]) 

is a relaxation of GG{S, Ai, ..., A„), and is p-network-decomposable by construction. 

Proof. Since {S U E,iF) is a network-decomposition of GC{S, Ai,..., Ak), for any tuple 
£ G L-®, GG{S,Ai, ...,Ak){£) = 0 if and only if ram.^,^^suE= 0. 

Let £' G be a tuple where this minimum is reached. This implies that VCt G T, 

Gt{£'[T]) = 0. Since/e(CT) < Gt, fe{GT){£'[T]) = 0. Therefore /,(CT)(f[T]) = 

0 and WJJGE{S, Ai,..., A*,, fe){£) = 0. Moreover, the global cost function is p-network- 
decomposable by construction. □ 

Theorem llGI allows to immediately derive a long list of network decomposable global 
cost functions from existing network decompositions of global constraints such as AllD- 
IFFERENT, REGULAR [H^, AmONG and STRETCH [13. The parameterization through fg 
also allows a lot of flexibility. 

Example 6. Consider the softened variant W_AllDifferent'^®'^(S') of the global con¬ 
straint AllDifferent(S') constraint using the decomposition violation measure where 
the cost of an assignment is the number of pairs of variables taking the same value fh^l . 
It is well known that AllDifferent decomposes into a set of binary difference 

constraints. Similarly, the W_AllDifferent'^®'^(S') cost function can be decomposed 
into a set of ggft difference cost functions. A soft difference cost function takes 

cost 1 iff the two involved variables have the same value and 0 otherwise. In these cases, 
no extra variable is required. 

AllDifferent can be softened in a different way. Take an arbitrary graph G = 
(y, E) over V, and consider the violation measure where the cost of an assignment is 
the number of pairs of variables in E taking the same value. This gives rise to a global 
cost function W-AllDifferent-^*^ (V) that allows a zero cost assignment if and only if 
G is colorable, which is an NP-hard problem. Enforcing any soft arc consistency on 
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that single global cost function will be intractable as well since it requires to compute the 
minimum of the cost function. Instead, enforcing soft arc consistencies on the network- 
decomposition into binary cost functions will obviously be polynomial hut will achieve a 
lower level of filtering. 

7. Local Consistency and Network-Decompositions 

As we have seen with the W_ALLDiFFERENT(y, fc) global cost function, the use of 
network-decompositions instead of a monolithic variant has both advantages and draw¬ 
backs. Thanks to local reasoning, a decomposition may be filtered more efficiently, but 
this may hinder the level of filtering achieved. In CSP, it was observed that the structure 
of the decomposition has an impact on the level of consistency achieved when filtering 
the decomposition. 

Before going further, we give some extra definitions that are useful to characterize 
structure of decompositions. The hypergraph {X, E) of a CFN {X, W, T) has one vertex 
per variable Xi € X and one hyperedge for every scope S such that 3Ws G W. The 
incidence graph of a hypergraph {X,E) is a bipartite graph G = {X U E,Eh) where 
{xi,ej} G Eh iff Xi G X,ej G E and Xi belongs to the hyperedge Cj. A hypergraph 
{X, E) is Berge-acyclic iff its incidence graph is acyclic. 

In CSP, it is known that if the decomposition is Berge-acyclic, then enforcing GAC 
on the decomposition enforces GAC on the global constraint itself [7[ . We now show that 
a similar result can be obtained for cost functions using either a variant of Directional 
Arc Consistency or Virtual Arc Consistency (VAC), whose definitions are given in the 
two subsections below. 

7.1. Berge-acyclicity and directional arc consistency 

In this section, we will show that enforcing directional arc consistency on a Berge- 
acyclic network-decomposition of a cost function or on the original global cost function 
yields the same cost distribution on the last variable and therefore the same lower bound 
(obtained by node consistency) provided a correct variable ordering is used. 

Directional Arc Consistency has been originally defined on binary networks. We 
define Terminal DAC (or T-DAC) which generalizes Directional Arc Consistency [1^ by 
removing the requirement of having binary scopes. 

Definition 26 (T-DAC). Given a CFN N = (A, W, T) a total order -< over variables: 

• For a cost function Ws G a tuple i € is a full support for a value a G D{xi) 

of x,G S iff Ws{t) = 0- 

• A variable Xi G S is star directional arc consistent (DAC*) for Ws iff 

— Xi is NC*; 

— each value Vi G D{xi) has a full support I for Ws- 

• N is Terminal Directional Arc Consistent (T-DAC) w.r.t. the order -< iff for all 
cost functions Ws G W'*", the minimum variable in S is DAC* for Wg. 
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To enforce T-DAC on a cost function Ws, it suffices to first shift the cost of every 
unary cost function Wi,i S S inside Ws by applying Project (5, (a), —Wi{a)) for 

every value a G Di. Let xj be the minimum variable in S according to one can then 
apply Project (S, (b), a) for every value b € D{xj) with a = min^^j^s Ws{()- 

Let £ be a tuple where this minimum is reached. Then either a = T and the value will be 
deleted, or ^ is a full support for b G D{xj): Ws{() 0a;.g5 Wi{£[xi]) = 0. This support 

can only be broken if for some unary cost function Wi,i G S,i j, Wi{a) increases for 
some value a G D{xi). Since j is minimum, i j. 

To enforce T-DAC on a CFN {X, W, T), one can simply sort W in a decreasing order 
of the minimum variable in the scope of each cost function, and apply the previous 
process on each cost function, successively. When a cost function Ws is processed, all 
the cost functions whose minimum variable is larger than the minimum variable of S 
have already been processed, which guarantees that none of the established full supports 
will be broken in the future. Enforcing T-DAC is therefore in 0(e(i’') in time, where 
e = |W| and r = maxu/ggw |<S'|. 

Theorem 17. If a global cost function W_GCF(S', Ai,..., Ak) decomposes into a Berge- 
acyclic CFN N = [SCE^F), there exists an ordering on SUE such that the unary cost 
function Wx^^ on the last variable Xi^ of S produced by enforcing T-DAC on the sub¬ 
network (S', {W -GCE(S, Ai,..., Ak)}U{Wxi}xies) is identical to the unary cost function 
IjT. produced by enforcing T-DAC on the decomposition N = (S U E,FU {Wxi}xies)- 

Proof. Consider the decomposed network N and In = {S U E U F,Ei) its incidence 
graph. As N is Berge-acyclic we know that In is a. tree whose vertices are the variables 
and the cost functions of N. We root /jv in a variable of S. The neighbors (parent and 
children, if any) of cost functions Wt are the variables in T. The neighbors of a variable 
Xi are the cost functions involving Xj. Consider any topological ordering of the vertices 
of In- This ordering induces a variable ordering (x^j,..., Xi^),Xi^ G S which is used to 
enforce T-DAC on N. Notice that for any cost function Wt G F, the parent variable of 
Wt Ai In appears after all the other variables of T. 

Consider a value a G D{xi^) of the root. Since NC* is enforced, Wxi^{a) < T. Let 
Wt be any child of Xi„ and I a full support of value a on Wt- We have Wx^^ia) = 
®xieT AAxi{^[xi]), which proves that Wt{(.) = 0 and Vxj GT,i ^ in, Wxi{I-[xi]) = 
0. In being a tree, we can inductively apply the same argument on all the descendants 
of Xi„ until leaves are reached, proving that the assignment (xi„ = a) can be extended 
to a complete assignment with cost Wx^^ (a) in N. In both cases, Wxi^ (a) is the cost of 
an optimal extension of (xi„ = a) in N. 

Suppose now that we enforce T-DAC using the previous variable ordering on the 
undecomposed sub-network (S, {W-GCE(S, Ai,..., Ak)} U {Wxi}xiGs)- Let £ he a full 
support of value a G D{xi^) on W_GCF(S', Ai,..., A^). By definition, 114,^^ (a) = 
W-GGF{S,Ai,... ,Ak){£)@^.^gWxi{£[xi]) which proves that (a) is the cost of 
an optimal extension of (xi„ = a) on {S,{WJGGF{S,Ai,... ,Ak)} U {Wxi}xies)- By 
definition of decomposition, and since Xi„ ^ E, this is equal to the cost of an optimal 
extension of (xi„ = a) in N. □ 

T-DAC has therefore enough power to handle Berge-acyclic network-decompositions 
without losing any filtering strength, provided a correct order is used for applying EPTs. 
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In this case, T-DAC emulates a simple form of dynamic programming on the network- 
decomposition. 

Example 7. Consider the Regular ({xi,..., x„}, M) global constraint, defined by a 
(not necessarily deterministic) finite automaton M = (Q,T,,d,qo, F), where Q is a set of 
states, E the emission alphabet, 6 a transition function from T, x Q ^ 2'^, qq the initial 
state and F the set of final states. As shown in this constraint decomposes into a 
constraint network ({xi,..., x„} U {Qo: ■ • ■ 7 Qn}, C) where the extra variables Qi have Q 
as their domain. The set of constraints C in the network decomposition contains two 
unary constraints restricting Qq to {< 70 } and Qn to F and a sequence of identical ternary 
constraints each of which authorizes a triple {q,s,q') iff q' G 6{q,s), thus 

capturing 6. A relaxation of this decomposition may relax each of these constraints. The 
unary constraints on Qq and Qn would be replaced by unary cost functions Aqq and pQ„ 
stating the cost for using every state as either an initial or final state while the ternary 
constraints would he relaxed to ternary cost functions cr^g. 3 ,^^^ stating the cost for 

using any (q, s, q') transition. 

This relaxation precisely corresponds to the use of a weighted automaton M\y = 
{Q, E, A, a, p) where every transition, starting and finishing state has an associated, pos¬ 
sibly intolerable, cost defined by the cost functions A, a and p f^I . The cost of an assign¬ 
ment in the decomposition is equal, by definition, to the cost of an optimal parse of the 
assignment by the weighted automaton. This defines a W_Regular(5'. Mw) global cost 
function which is parameterized by a weighted automaton. As shown in uUl . a weighted 
automaton can encode the Flamming and Edit distances to the language of a classical 
automaton. We observe that the hypergraph of the decomposition of W_Regular is 
Berge-acyclic. Thus, contrary to the AllDifferent example, where decomposition was 
hindering filtering, T-DAC on the W_Regular network-decomposition achieves T-DAC 
on the original cost function. 

It should be pointed out that T-DAC is closely related to mini-buckets [3l| and 
Theorem [m can easily be adapted to this scheme. Mini-buckets perform a weakened 
form of variable elimination: when a variable x is eliminated, the cost functions linking 
X to the remaining variables are partitioned into sets containing at most i variables in 
their scopes and at most m functions (with arity >1). If we compute mini-buckets 
using the same variable ordering, with m = 1 and unbounded i, we will obtain the same 
unary costs as T-DAC on the root variable r, with the same time and space complexity. 
Mini-buckets can be used along two main recipes: precomputed (static) mini-buckets 
do not require update during search but restrict search to one static variable ordering; 
dynamic mini-buckets allow for dynamic variable ordering (DVO) but suffer from a lack 
of incrementality. Soft local consistencies, being based on EPTs, always yield equivalent 
problems, providing incrementality during search and are compatible with DVO. 

7.2. Berge-acyclicity and virtual arc consistency 

Virtual Arc Consistency offers a simple and direct link between CSPs and CFNs 
which allows to directly lift CSP properties to CFNs, under simple conditions. 

Definition 27 (VAC [1^). Civen a CFN N = (A, W, T), we define the constraint 
network Bool{N) as the CSP with the same set X of variables with the same domains, 
and which contains, for each cost function Ws G W, IS”! > 0, a constraint cs with the 
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same scope, which exactly forbids all tuples i G such that Ws{£) ^ 0. A CFN N is 
said to be Virtual Arc Consistent (VAC) iff the arc consistent closure of the constraint 
network Bool{N) is non empty. 

Theorem 18. If a global cost function W_GCF(S', Ai,..., Ak) decomposes into a Berge- 
acyclic CFN N = then enforcing VAC on either (SUE,iFU{Wxi}xieS, T) 

or on {S, {WJCCE{S, Ai, ..., Ak)} U {Wxi}ies^ T) yields the same lower bound W 0 . 


Proof Enforcing VAC on the CFN N = {SUE,TU {Wxi}xieS,~^) does not modify the 
set of scopes as it only performs 1-EPTs (See Definition H]) . Hence it yields an equivalent 
problem N' such that Bool{N') has the same hypergraph as Bool{N). Since N has a 
Berge acyclic structure, this is also the case for Bool{N) and Bool{N'). Now, Berge- 
acyclicity is a situation where arc consistency is a decision procedure. We can directly 
make use of Proposition 10.5 of 22|, which states that if a CFN N is VAC and Bool{N) 
is in a class of CSPs for which arc consistency is a decision procedure, N has an optimal 
solution of cost w^. 

Similarly, the network Q = {S,{W-GCF{S, Ai,..., Ak)} U {kFxijxieT, T) contains 
just one cost function with arity strictly above 1 and Bool(Q) will be decided by arc 
consistency. Enforcing VAC will therefore provide a CFN which also has an optimal 
solution of cost W^. Finally, the networks N and Q have the same optimal cost by 
definition of a decomposition. □ 


Civen that VAC is both stronger and more expensive to enforce than DAC*, the 
added value of this theorem, compared to theorem 1171 is that it does not rely on a 
variable ordering. Such order always exists but it is specific to each global cost function. 
Theorem [18] becomes interesting when a problem contains several global cost functions 
with intersecting scopes, for which theorem 1171 may produce inconsistent orders. 


8. Relation between DAG-filterability and Network-Decompositions 

In this section, we show that Berge-acyclic network-decomposable global cost func¬ 
tions are also polynomially DAC-filterable. 

Theorem 19. Let WJCCE{S,Ai,... ,Ak) he a network-decomposable global cost func¬ 
tion that decomposes into a CFN (S U E, E,T) with a Berge-acyclic hypergraph. Then 
W_CCF(S', Ai,..., Ak) is polynomially DAC-filterable. 

Proof. We consider the incidence graph of the Berge-acyclic hypergraph of the CFN 
{S U E,E,T) and choose a root for it in the original variables S, defining a rooted tree 
denoted as I. This root orients the tree I with leaves being variables in S and E. In the 
rest of the proof, we denote by I{xi) the subtree of I rooted in Xi G S Li E. Abusively, 
when the context is clear, I{xi) will also be used to denote the set of all variables in the 
subtree. 

The proof is constructive. We will transform I into a filtering DAC (actually a 
tree) of nodes that computes the correct cost min^,^j^suE 0Wj,e.F 

satisfies all the required properties of polynomial DAC-filters. To achieve this, we need 
to guarantee that the aggregation function /^ = 0 is always used on cost functions of 
disjoint scopes, that fi = min is always applied on identically scoped functions and that 
sizes remain polynomial. 
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We will be using three types of DAG nodes. A first type of node will be associated 
with every cost function Wr € in the network-decomposition. Each cost function 
appears in I with a parent variable Xi and a set of children variables among which some 
may be leaf variables. By the assumption that extra variables belong to at least two cost 
functions (see paragraph below Definition [55]) , leaf variables necessarily belong to S. We 
denote by leaf{T) the set of leaf variables in the scope T. The hrst type of node aims at 
computing the value of the cost function Wt combined with the unary cost functions on 
each leaf variable. This computation will be performed by a family of nodes C/^, where 
^ G is an assignment of non-leaf variables. Therefore, for a given cost function 

Wt and a given assignment £ of non-leaf variables, we define a DAG node with scope 
leaf{T): 


UUn = WT{£u£') 0 w,,{£'[x,]) 

XjGleaf{T) 

These nodes will be leaf nodes of the filtering DAG. Given that all cost functions in 
I have bounded arity, these nodes have an overall polynomial size and can be computed 
in polynomial time in the size of the input global cost function. 

Nodes of the second and third types are associated to every non-leaf variable Xi in I. 
For every value a G D(xi), we will have a node ujf with scope I{xi) D S. Xi may have 
different children cost functions in I and we denote by Wi the set of all the children cost 
functions of Xi in I. For each Wt G Wi, we will also have a DAG node with scope 
Si = {I{Wt) U {a;^}) D S. Notice that even if these scopes may be large (ultimately equal 
to S for wf if Xi is the root of /), these nodes are not leaf nodes of the filtering DAG and 
do not rely on an extensional definition, avoiding exponential space. 

The aim of all these nodes is to compute the cost of an optimal extension of the 
assignment £ to the subtree I[Wt) (for or I{xi) (for wf). We therefore define: 


Indeed, if 0 ;^“ computes the cost of an optimal extension to the subtree rooted in 
Wt, an optimal extension to I{xi) is just the © of each optimal extension on each child, 
since the scopes S[ do not intersect (J is a tree). The DAG node uses the © aggregation 
operator on non-intersecting scopes. 

The definition of the DAG nodes is more involved. It essentially requires: 

1. to combine the cost of Wr with the unary cost functions on leaf variables in T (this 
is achieved by Ut nodes) and costs of optimal extensions subtrees rooted in other 
non-leaf variables (this is achieved by nodes). 

2. to eliminate in this function all extra variables in the scope T except Xi if Xi G E. 
In this case, XiS value will be set in £ and eliminated on higher levels. 

If Xi G A or else if £\xi] = a, this leads to the following definition of uji^°'{£): 


mm 

T THE 

(xj^^Sv£'[xj^] = a) 


xje(T-leaf(T)-{xi}) 


( 5 ) 
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Otherwise {xi S S and ^[xi] ^ a), = T. This captures the fact that there is 

no optimal extension of i that extends {xi,a) since i is inconsistent with xi = a. 

If we consider the root variable Xi £ S of /, the wf nodes provide the cost of a best 
extension of any assignment £ (if l[xi\ = a) or T otherwise. An ultimate root DAG node 
using the aggregation operator min over all these a;“ will therefore return the optimal 
extension of £ G L'^ to all variables in I{xi), including extra variables. 

From equation [3 one can see that nodes 0 ;^“ use the aggregation operator min on 
intermediary nodes. These intermediary nodes combine the node Ut and ujj with 0 
which have non-intersecting scopes. 

Overall all those nodes form a DAG (actually a tree). In this tree, every node with the 
aggregation operation 0 is applied to operands with non-intersecting scopes, as required 
in Property [2l Similarly, every node with the min aggregation operation is applied to 
functions whose scope is always identical, as required by Property [3] Note that the 
definitions of the wf and are linear respectively in the number of children of Wt or 
Xi respectively. So, we have a filtering DAG satisfying Definition [131 CH 

For a global cost function which is Berge-acyclic network-decomposable, and therefore 
also polynomially DAG-filterable (as Theorem [T31 shows'), a natural question is which 
approach should be preferred. The main desired effect of enforcing local consistencies is 
that it may increase the lower bound W 0 . From this point of view. Theorems [17] and [18] 
give a clear answer for a single global cost function. 

• Since OSAG Q is the strongest form of arc consistency (implying also VAG), 
the strongest possible lower bound will be obtained by enforcing OSAG on the 
network-decomposed global cost function. The size of the OSAG linear program 
being exponential in the arity of the cost functions, the bounded arities of the 
network decomposed version will define a polynomial-size linear program. This 
however requires an LP solver. 

• If a network containing network-decomposed global cost functions is VAG, the 
underlying global cost functions are also VAG. As a result, good quality lower 
bounds can be obtained by enforcing VAG. These lower bounds are not as good as 
those obtained by OSAG, but VAC is usually much faster than OSAG. 

• T-DAC is otherwise extremely efficient, easy to implement, offering good lower 
bounds and incrementality for little effort. However, when several global cost func¬ 
tions co-exist in a problem, a variable order that is a topological sort of all these 
global cost functions may not exist. In this case, using a topological order for each 
scope independently would lead to the creation of cycles leading to possibly infinite 
propagation. It may then be more attractive to use filtering DAGs to process these 
cost functions. 

Finally, it should be noted that Theorem [171 only guarantees that T-DAC on a global 
cost function or its topologically sorted Berge-acyclic network-decomposition provide 
the same bound contribution. If a consistency stronger than DAG* is enforced (such as 
FDAC* or ED AC*), it may be more powerful when enforcedt on the global cost function 
itself than on its network-decomposition, thus giving an advantage to filtering DAGs. 

In the end, the only truly informative answer will be provided by experimental results, 
as proposed in Section [3] 
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9. Experiments 


In this section, we put theory into practice and demonstrate the practicality of the 
transformations described in the previous sections in solving over-constrained and opti¬ 
mization problems. We implemented cost functions with our transformations in toulbar2 
vO.9.^. For each cost function used in our benchmark problems, we impkmented weak 
Existential Directional Generalized Arc Consistency (EDGAC*) [29|, l49|, , a local con¬ 

sistency combining AC, DAC and EAC, using DAG-filtering (called DAG-hased approach 
in the sequel) with pre-computed tables (as described in Section |3]). When possible, 
we also implemented a Berge-acyclic network-decomposition to be propagated using 
EDGAC* (called network-based approach). We ignore weaker forms of local consistency 
such as Arc Consistency or 0-inverse consistency |68j as previous experiments with global 
cost functions have shown that these weak local consistencies lead to much less efficient 
solving (^ . 

In the experiments, we used default options for toulbar2, including a new hybrid best- 
first search strategy introduced in @ , which finds good solutions more rapidly compared 
to classical depth-first search. The default variable ordering strategy is dom/wdeg [ 2 ^ 
with Last Conflict while the default value ordering consists, for each variable, in 
choosing first its fully supported value as defined by EDGAC*. At each node during 
search, inclu ding the root node, we eliminate dominated values using Dead End Elimina- 
^ and we eliminate all variables having degree less than two using 
ISl 1^ . At the root node only, this is improved by pairwise decom- 


tion pruning 0) 
variable elimination 

position 33 and we also eliminate all variables having a functional or bijective binary 
relation {e.g., an equality constraint) with another variable. The tests are conducted on 
a single core of an Intel Xeon E5-2680 (2.9GHz) machine with 256GB RAM. 

We performed our experiments on four different benchmark problems. For the two 
first benchmarks (car sequencing and nonogram), we have a model with Berge-acyclic 
network-decompositions, whereas for the two others (well-formed parentheses and mar¬ 
ket split), we do not. Each benchmark has a 5-minute timeout. We randomly generate 
30 instances for each parameter setting of each benchmark. We first compare the num¬ 
ber of solved instances, i.e. finding the optimum and proving its optimality (no initial 
upper bound). We report the average run-time in seconds and consider that an unsolved 
problem requires the maximum available time (timeout). When all instances are solved, 
we also report the average number of backtracks (or otherwise). The best results are 
marked in bold (taking first into account the number of solved instances in less than 5 
minutes and secondly CPU time). 


9.1. The Car Sequencing Problem 

The car sequencing problem (probOOl in CSPLib, [H^) requires sequencing n cars 
of different types specified by a set of options. For any subsequence of ct consecutive 
cars on the assembly line, the option Oi can be installed on at most of them. This 
is called the capacity constraint. The problem is to find a production sequence on the 
assembly line such that each car can be installed with all the required options without 
violating the capacity constraint. We use n variables with domain I to n to model this 


^ http://www.inra.fr/mia/T/toulbar2/ 
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problem. The variable Xi denotes the type of the car in the sequence. One GCC 
(global cardinality 53|) constraint ensures all cars are scheduled on the assembly line. 
We post n — Ci + 1 Among constraints 12| for each option Oi to ensure the capacity 
constraint is not violated. We randomly generate 30 over-constrained instances, each of 
which has 5 possible options, and for each option Oi, rrii and Ci are randomly generated 
in such a way that 1 < rrii < Ci < 7. Each car in each instance is randomly assigned 
to one type, and each type is randomly assigned to a set of options in such a way that 
each option has 1/2 chance to be included in each type. To introduce costs, we randomly 
assign unary costs (between 0 to 9) to each variable. 

The problem is then modeled in three different ways. The first model is obtained by 
replacing each Among constraint by the W_Among’'“’’ cost function and the GCC 
constraint by the W_GCC’'“’’ cost function. W_Among"“’' returns a cost equal to 
the number of variables that need to be re-assigned to satisfy the Among constraint. 
W_GCC’'“’’ is used as a global constraint and returns T on violation [^. This model 
is called “flow&DAG-based” approach in Table [5] 

The second model, identified as “DAG-based” in Table [21 uses a set of W_Among"“’' 
cost functions to encode GCC, i.e. replacing the single global cost function exploiting a 
flow network by a set of DAG-based global cost functions Q . 

In the third model, identified as “network-based” in Table|21 each of the W_Among"“” 
in the previous DAG-based model is decomposed into a set of ternary cost functions with 
extra variables as described in Section [51 

Table|2]gives the experimental results. Column n' indicates the sum of the number of 
original variables (n) and the number of extra variables added in the network-based ap¬ 
proach. Column n" gives the total number of unassigned variables after pre-processing. 
We observe that the network-based approach performed the worst among the three ap¬ 
proaches. The DAG-based approach is up to six times faster than the flow&DAG-based 
approach on completely solved instances (n < 13) and solves more instances within 
the 5-minute time limit. Surprisingly, it also develops the least number of backtracks 
on completely solved instances. We found that the initial lower bound produced by 
weak EDGAC on the flow&DAG-based approach can be lower than the one produced 
by the DAG-based approach. This is due to different orders of EPTs done by the two 
approaches resulting in different lower bounds. Finding an optimal order of integer arc- 
EPTs is NP-hard [^. Recall that EDGAC has a chaotic behavior compared to OSAC 
or VAC and encoding GCC into a set of W-Among”®” will produce more EPTs (each 
W_Among'"“’' moving unary costs differently) creating new opportunities for the over¬ 
lapping W-Amongs”®” to deduce a better lower bound. 


9.2. The Nonogram Problem 

The nonogram problem (prob012 in CSPLIB [^) is a typical board puzzle on a 
board of size p x p. Each row and column has a specified sequence of shaded cells. For 
example, a row specified (2, 3) contains two segments of shaded cells, one with length 2 
and another with length 3. The problem is to find out which cells need to be shaded such 
that every row and every column contain the specific sequence of shaded cells. We model 
the problem hy n = p^ variables, in which Xij denotes whether the cell at the row 
and column needs to be shaded. In the experiments, we generate random instances 
from perturbed white noise images. A random solution grid, with each cell colored with 
probability 0.5, is generated. A feasible nonogram problem instance is created from the 
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Table 2: Car sequencing problem (timeout=5min). For each approach, we give the number of instances 
solved (#), the mean number of backtracks only if all the instances have been completely solved (bt.), 
and the mean CPU time over all the instances (in seconds). 


n 

flow&DAG-based 

DAG-based 

network-based 

# 

bt. 

time 

# 

bt. 

time 

n' 

n" 

# 

bt. 

time 

8 

30 

19.7 

0.10 

30 

13.6 

0.03 

154 

102 

30 

210.9 

0.11 

9 

30 

58.1 

0.31 

30 

36.4 

0.09 

198 

135 

30 

798.5 

0.41 

10 

30 

109.9 

0.88 

30 

82.1 

0.21 

245 

170 

30 

3,372 

2.0 

11 

30 

193.2 

2.1 

30 

156.7 

0.50 

293 

206 

30 

17,286 

12.2 

12 

30 

522.0 

8.0 

30 

306.1 

1.4 

344 

245 

29 

- 

90.5 

13 

30 

1,251 

22.6 

30 

963.1 

4.9 

396 

285 

10 

- 

233.5 

14 

26 

- 

86.4 

30 

3,227 

20.4 

451 

328 

2 

- 

280.3 

15 

17 

- 

160.4 

29 

- 

72.1 

507 

372 

2 

- 

283.8 

16 

12 

- 

204.9 

23 

- 

111.8 

566 

419 

1 

- 

297.3 


lengths of the segments observed in this random grid. To make it infeasible, for each row 
and each column, the list of segment lengths is randomly permuted, i.e., its elements are 
shuffled randomly. If a list is empty, then a segment of random length I is added (0 < ( < 
p). We model and soften the restrictions on each row and column by W_Regular’'“’’, 
resulting in three models: flow-based, DAG-based, and network-based. The flow-based 
model uses the W_Regular’'“’' implementation based on minimum cost flows described 
in , the DAG-based version uses the filtering DAG (see Q for implementation details), 
and the network-based version uses the decomposition presented in Example [T] 

Table [3] shows the results of the experiments. For medium-size problems {p < 9, 
n < 81), the network-based approach develops the least number of backtracks on average 
compared to the two other approaches. Value and variable elimination at pre-processing 
reduces the number of variables by a factor greater than two. The flow-based and DAG- 
based approaches develop the same number of backtracks, producing the same EPTs, 
but the dynamic programming algorithm implemented in the DAG-based approach is 
about one order-of-magnitude faster than the minimum cost flow algorithm used in the 
flow-based approach. Moreover, the network-based approach is at least one order-of- 
magnitude faster than the DAG-based approach. On the largest instances, because of an 
exponential increase of the number of backtracks, the network-based approach becomes 
unable to solve all the instances in less than five minutes, but still outperforms the other 
two approaches. 

9.3. The Well-formed Parentheses problem 

In this experiment, we use a network-decomposition of the W_Grammar constraint 
whose structure is depicted in Figure[6l It is obviously not Berge-acyclic. This experiment 
will allow us to see the behavior of network-decompositions when they are not Berge- 
acyclic. 

Given a set of 2p even length intervals within [1,..., 2p], the well-formed parentheses 
problem is to find a string of parentheses with length 2p such that substrings in each of 
the intervals are well-formed parentheses. We model this problem by a set of n = 2p 
variables. Domains of size 6 are composed of three different parenthesis types: ()[]{}■ We 
post a W_Grammar’'“’’ cost function on each interval to represent the requirement of 
well-formed parentheses. We generate 2p — 1 even length intervals by randomly picking 
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Table 3: Nonogram (timeout=5min). For each approach, we give the number of instances solved (#), 
the mean number of backtracks only if all the instances have been completely solved (bt.), and the mean 
CPU time over all the instances (in seconds). 


n 

flow-based 

DAG-based 

network-based 


# 

bt. 

time 

# 

bt. 

time 

n' 

n" 

# 

bt. 

time 

36 

30 

11.4 

0.09 

30 

11.4 

0.01 

96 

18 

30 

4.4 

0.00 

49 

30 

41.8 

0.29 

30 

41.8 

0.05 

133 

42 

30 

22.5 

0.01 

64 

30 

186.4 

2.3 

30 

186.4 

0.26 

176 

64 

30 

90.3 

0.01 

81 

30 

254.4 

4.5 

30 

254.4 

0.50 

225 

97 

30 

248.9 

0.04 

100 

25 

- 

86.0 

30 

3,581 

10.8 

280 

131 

30 

3,861 

0.47 

121 

19 

- 

166.8 

26 

- 

72.8 

341 

171 

30 

12,919 

1.6 

144 

3 

- 

279.6 

9 

- 

233.2 

408 

224 

28 

- 

44.1 

169 

0 

- 

300.0 

5 

- 

266.5 

481 

267 

23 

- 

116.4 

196 

0 

- 

300.0 

1 

- 

297.1 

560 

330 

7 

- 

257.1 


their end points in [1,... ,2p\, and add an interval covering the whole range to ensure 
that all variables are constrained. We also randomly assign unary costs (between 0 and 
10 ) to each variable. 

We compare two models. The first model, the DAG-based approach, is obtained by 
modeling each W_Grammar’'“’’ cost function using a filtering DAG approach. 

In the second network-based model, we decompose each W_Grammar’'“'' cost func¬ 
tion involving m variables using m{m + l)/2 extra variables Pij (1 < j < m, 1 < i < 
m — j + 1) whose value corresponds to either a symbol value (for j = 1) or a pair of a 
symbol value S and a string length k {1 < k < j, for j > 2) associated to the substring 
{i,i + j — 1), starting from i of length j. Ternary cost functions link every triplet Pij, 
Pi^k, Pi+k,j-k so that there exists a compatible rule S->AB in order to get the substring 
{i, i+j — 1) from the two substrings {i,i + k — 1) and {i + k,i + j — 1) when Pij = (S, k), 
Pi,k = (A, u), Pi+k,j-k = {B, v) with u < k, V < j — k. Binary cost functions are used to 
encode the terminal rules between Pi i (i £ [l,m]) and the original variables. 


Table 4: Soft well-formed parentheses (timeout=5min). For each approach, we give the number of 
instances solved (#), the mean number of backtracks only if all the instances have been completely 
solved (bt.), and the mean CPU time over all the instances (in seconds). 


n 

DAG-based 

network-based 


# 

bt. 

time 

n' 

n" 

# 

bt. 

time 

8 

30 

3.5 

0.06 

145 

131 

30 

676.8 

0.21 

10 

30 

6.6 

0.60 

250 

228 

30 

63,084 

12.5 

12 

30 

9.1 

3.8 

392 

361 

7 

- 

260.3 

14 

30 

21.1 

8.3 

580 

538 

0 

- 

300 

16 

29 

- 

48.9 

841 

785 

0 

- 

300 

18 

23 

- 

115.6 

1,146 

1,075 

0 

- 

300 


Results are shown in Table 01 The network-based approach is clearly inefficient. It 
has n' = 1,146 variables on average for p = 9 (n = 18). The number of backtracks 
increases very rapidly due to the poor propagation on a non Berge-acyclic network. The 
DAG-based approach clearly dominates here. Notice that the DAG-based propagation 
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Figure 6: Network associated to the decomposition of W_Grammar““''(xi, X2, X3,0:4, xs, aie). 


of W_Grammar’'“’' can be very slow with around 1 backtrack per second for p = 9. 

As a second experiment on well-formed parentheses, we generate new instances using 
only one hard global grammar constraint and a set of p(2p — 1) binary cost functions 
corresponding to a complete graph. For each possible pair of positions, if a parentheses 
pair (0, [], or {}) is placed at these specific positions, then it incurs a randomly-generated 
cost (between 0 to 10). A single W_Grammar"“'’ cost function is placed on all the 
n = 2p variables, which returns T on violation (a Grammar constraint), ensuring that 
the whole string has well-formed parentheses. As in the experiments of Table SI the two 
models are characterized by how the consistency is enforced on the W_Grammar"“’' 
cost function: a filtering DAG for the DAG-based approach, a network-decomposition 
for the network-based approach. 

Results are shown in Table [5] The network-based approach still develops more back¬ 
tracks on average for p > 6 (n > 12) than the DAG-based approach but the difference is 
less important than in the previous experiment because there is a single grammar con¬ 
straint. Surprisingly, for p < 5, the network-based approach develops less backtracks than 
the DAG-based approach. The network-based approach benehts from variable elimina¬ 
tion that exploits bijective binary relations occurring in the decomposed hard grammar 
cost function. Moreover, having only one global constraint implies less extra variables for 
the network-based approach than in the previous experiment {n' = 189 for p = 9 instead 
of n' = 1,146). The propagation speed of the network-based approach is much better 
than the DAG-based approach, with ^4,100 bt./sec instead of ^23 bt./sec for p = 9, 
resulting in better overall time efficiency compared to the DAG-based approach, being 
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up to 8 times faster for p = 7 to solve all the thirty instances. 


Table 5: Well-formed parentheses (single hard global constraint) with additional binary cost functions 
(timeout=5min). For each approach, we give the number of instances solved (#), the mean number of 
backtracks if available (bt.), and the mean CPU time (in seconds). 


n 

DAG-based 

network-based 


# 

bt. 

time 

n' 

n” 

# 

bt. 

time 

8 

30 

37.4 

0.11 

44 

33 

30 

19.5 

0.04 

10 

30 

105.5 

0.51 

65 

51 

30 

100.5 

0.12 

12 

30 

265.4 

2.4 

90 

73 

30 

916.5 

0.38 

14 

30 

887.7 

14.0 

119 

99 

30 

6,623 

1.7 

16 

30 

3,037 

80.6 

152 

129 

30 

54,544 

12.0 

18 

13 

- 

257.9 

189 

163 

30 

394,391 

95.7 


9.4- The Market split problem 

In some cases, problems may contain global cost functions which are not network- 
decomposable because the bounded arity cost function decomposition is not polynomial 
in size. However, if the network is Berge-acyclic, Theorem [T7] still applies. With ex¬ 
ponential size networks, filtering will take exponential time, but may yield strong lower 
bounds. The global constraint X]r=i = h (a and b being integer coefficients) can 
be easily decomposed by introducing n — 3 intermediate sum variables qi and ternary 
sum constraints of the form + aiXi = qi with i G [3,n — 2] and aiXi + 02 X 2 = 92 , 
qn -2 + o-n-iXn-i+ cLnXn = h. More generally, ternary decompositions can be built for the 
more general case where the right hand side of the constraint uses any relational operator, 
including any Knapsack constraint. In this representation, the extra variables qi have 
b values in their domain, which is exponential in the size of the representation of b (in 
log(6)). As for the pseudo-polynomial Knapsack problem, if b is polynomially bounded 
by the size of the global constraint, propagation will be efficient. It may otherwise be 
exponential in it. 

As an example, we consider a g eneralized version of the Knapsack problem, the 
Market Split problem defined in |26l . l66 |. The goal is to minimize such that 

Sr=i “bt-®* = bj for each j G and Xi are Boolean variables in {0,1} (o, a and 

b being positive integer coefficients). We compared the Berge-acyclic decomposition 
in toulbar2 (version 0.9.8) with a direct application of the Integer Linear Programming 
solver cplex (version 12.6.3.0). We used a depth-first search with a static variable ordering 
(in decreasing order) and no pre-processing (options -hbfs: -svo -0 -nopre) for 

toulbar2. We generated random instances with random integer coefficients in [0,99] for 
o and a, and bj = ^i,j\■ We used a sample of 30 problems with m = 4, n = 30 

leading to maxbj = 918. The mean number of nodes developed in toulbar2 was 29% 
higher than in cplex, which was on average 4.5 times faster than toulbar2 on these 
problems. The 0/1 knapsack problem probably represents a worst case situation for 
toulbar2, given that cplex embeds much of what is known about 0/1 knapsacks (and 
only part of these extend to more complicated domains). Possible avenues to improve 
toulbar2 results in this unfavorable situation would be to use a combination of the m 
knapsack constraints into one as suggested in (6^ . 
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10. Conclusion 


Existing tools for solving optimization on graphical models are usually restricted to 
cost functions involving a reasonably small set of variables, often using an associated 
cost table. But problem modeling may require to express complex conditions on a non- 
bounded set of variables. This has been solved in Constraint Programming by using 
Global Constraints. Our results contribute to lift this approach to the more general 
framework of cost function networks, allowing to express and efficiently process both 
global constraints and global cost functions, using dedicated soft arc consistency filtering. 

Our contributions are four-fold. First, we define the tractability of a global cost 
function, and study its behavior with respect to projections/extensions with different 
arities of cost functions. We show that tractable r-projection-safety is always possible for 
projections/extension to/from the nullary cost function, while it is always impossible for 
projections/extensions to/from r-ary cost functions for r >2. When r = 1, we show that 
a tractable cost function may or may not be tractable 1-projection-safe. Second, we define 
polynomially DAG-filterable cost functions and show them to be tractable 1-projection- 
safe. We give also a polytime dynamic programming based algorithm to compute the 
minimum of this class of global cost functions. We also show that the cost function 
W_GRAMMAR'““’'is polynomially DAG-filterable and tractable 1-projection-safe. The 
same results applies to W_Among"“’', W_Regular'"“’", and W_Max/W_Min as shown 
in the associated technical report Q. Third, we show that dynamic programming can be 
emulated by soft consistencies such as DAG and VAC if a suitable network decomposition 
of the global cost function into a Berge-acyclic network of bounded arity cost functions 
exists. In this case, local consistency on the decomposed network is essentially as strong 
as on the global cost function. This approach is shown to be a specific case of the previous 
approach in the sense that any Berge-acyclic network-decomposable cost function is also 
polynomially DAG-filterable. Finally, we perform experiments and compare the DAG- 
based and network-based approaches, in terms of run-time and search space. The DAG- 
based approach dominates when there are several overlapping global cost functions. On 
the contrary, the network-based approach performs better if there are few global cost 
functions resulting in a reasonable number of extra variables. This is complexified by 
additional techniques such as boosting search by variable elimination [i^. Weighted 
Degree heuristics |20l |. and Dead-End Elimination [s^ which work better with the low- 
arity cost functions of the network-based approach. We also compare against the flow- 
based approach and show that our approaches are usually more competitive. On 
Berge acyclic network-decomposable cost function just as W_Regular'^“’’ , this is not 
unexpected as the dynamic programming based propagation or its emulation by T-DAC 
essentially solves a shortest path problem, which can easily be reduced to the more 
general min-cost flow problem used in [H^ which can itself be reduced to LP [l[. As 
problems become more specific, algorithmic efficiency can increase. 

An immediate possible future work is to investigate other sufficient conditions for 
polynomially DAG-filterable and also tractable 1-projection-safety. Our results only 
provide a partial answer. Whether there exists necessary conditions for polynomially 
DAG-filterable is unknown. Besides polynomially DAG-filterable, we would like to in¬ 
vestigate other form of tractable 1-projection-safety and techniques for enforcing typical 
consistency notions efficiently. 
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1 Introduction 

This companion manuscript should be read in conjunction with the paper of Allouche, Bessiere, Boizumault, 
de Givry, Guttierez, Lee, Leung, Loudni, Metivier, Schiex, and Wu (2015), which contains the necessary 
definitions, notations and theorems. 


2 Polynomial DAG-Decomposable Global Gost Functions 

In the following, we show that W_Among^“’',W_Regular’'“’’, W_Max, and W_Min are Polynomially 
DAG-Decomposable and thus Tractable Projection-Safe. For simplicity, we assume the scope of each global 
cost function to be S' = {xi, ..., Xn}- 

2.1 The W_Among™'' Cost Function 

W-Among’''*’’ is the cost function variant of the softened version of Among using the corresponding variable- 
based violation measure (Solnon, Cung, Nguyen, & Artigues, 2008). 

Definition 1. (Solnon et at, 2008) Given a set of values V, a lower bound lb and an upper bound ub 
such that 0 < lb < ub < |S|. W_Among"“'’(S, u 6 , U) returns max{0,lb — t{£,V),ti^,V) — ub}, where 
t{£,V) = \{i I € V}\ for each tuple £ G D{S). 

Example 1. Consider S = {a;i, a; 2 , X 3 }, where D(xi) = D(x 2 ) = D{x^) = {a,b,c,d}. The cost returned by 
W_AMONG’'“yS, l,2 ,{a,6 })(U is: 

• 0 if £ = {a, b, c, d); 

• 1 if £ = {c, d, c, d); 

• 2 if £ = {a, b, a, b); 

Theorem 1 . W_Among"“’'(S,/ 6 , U) is polynomially DAG-decomposable and thus tractable projection- 
safe. 

Proof. We first define two base cases U)' and uj. The function Ug is the cost function on Xi defined as: 

U'^ (v) = I Av €V-, 

® 1 otherwise 


and C/j (w) = 1 — UY is its negation. 
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We construct W_Among““’’ based on and C/^. Define = W_AMONG““’’(5'i, j, j, D), where Si = 
{xi,..., Xi} C S. By definition, Si = Si-i U {xi\ and Sq = 0. W_Among’'“’'(S', lb, ub, V) can be represented 
by the sub-cost functions ojg. as: 

W_Among"“’'(5',/&,u6,D)(^) = min {wi W} 

lb<j<ub ^ 

and each ujg. can be represented as: 

The equations form a DAG {V,E), as illustrated in Figure 1 using Example 1. In Figure 1, leaves are 
indicated by double-lined circles. Vertices with min or © aggregators are indicated by rectangles and circles 
respectively. The DAG has a number of vertices \V\ = 0{ub ■ n) = 0{v?) and uses only using © or min as 
aggregations with proper scopes properties. By Theorem 6 and 7 by Allouche et al. (2015), W_Among’^“’’ 
is safely DAG- decomposable. Moreover, each leaf of the DAG is a unary cost functions. By Definition 21 
from Allouche et al. (2015), W_Among"“’' is therefore polynomially DAG-decomposable. The result follows 
by Theorem 9 by Allouche et al. (2015). 



Figure 1: The DAG corresponding to W_Among’'“’’ 

□ 

Function AmongMin in Algorithm 1 computes the minimum of the W_Among*'“'’(S', lb, ub, V) cost function 
according to Theorem 1. Lines 1 to 3 compute the minimum costs returned by each additional unary cost 
functions and store at the arrays u and u. Lines 4 to 8 builds up the results by filling the table / of size 
n X ub according to the formulation stated in the proof of Theorem 1, and return the result at line 9. The 
complexity is stated in Theorem 2 as follows. 

Theorem 2. Function AmongMin in Algorithm 1 computes the minimum of W_Among^°'’'{S, lb, ub,V) and 
requires 0(n(n + d)), where n = [S'] and d is the maximum domain size. 

Proof. Lines 1 to 3 in Algorithm 1 take 0{nd). Lines 4 to 9 requires 0(n • ub). Since ub is bounded by n, the 
result follows. □ 

Corollary 1. Given Ws = W_Among““’^(S', 16, V). Function findSupport{) in Algorithm 3 from Al¬ 

louche et al. (2015) requires 0(nd(n + d)). 

Proof. Follow directly from Property 2 in Allouche et al. (2015) and Theorem 2 . □ 


= ujl_^ii[Si.,])(BUAn^i]) 

. r u;^g-\m_,])(Bur{i[xi]) 

= mm < _ V 

\ u;i_^m-i])(BUi ii[xi]) 


for i > 0 


for j > 0 and i > 0 
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Function AmongMin(5',/&, F) 

1 for i = 1 to n do 

2 m[z] := min{C/^}; 

3 M[i] := min{[/j'^}; 

4 for j = 0 to ub do f[0,j] := j ; 

5 for i = 1 to n do 

6 /[i,0] :=/[i - 1,0] ©m[i] ; 

7 for j = 1 to ub do 

9 return minih<j<„b{/[n,j]} ; 

Algorithm 1: Finding the minimum of W_Among’'“’’ 

2.2 The W_Regular^“^ Cost Function 

W_Regular’'“'’ is the cost function variant of the softened version of the hard constraint Regular (Pesant, 
2004) based on a regular language. 

Definition 2. A regular language L{M) is represented by a deterministic finite state automaton (DFA) 
M = {Q,T,,S,qo,F), where: 

• Q is a set of states; 

• Ft is a set of characters; 

• The transition function 6 is defined as: S : Q x F Q; 

• qo ^ Q is the initial state, and; 

• F C Q is the set of final states. 

A string r lies in L{M), written as t G L{M), iff t can lead the transitions from qo to qf G F in M 

The hard constraint Regular(5', M) authorizes a tuple £ G D{S) if G L{M), where Ti is the string 
formed from £ (Pesant, 2004). A W_Regular’'“’' cost function is defined as follows, derived from the 
variable-based violation measure given by Beldiceanu, Carlsson, and Petit (2004) and van Hoeve, Pesant, 
and Rousseau (2006). 

Definition 3. (Beldiceanu et al., 2004; I’an Hoeve et al, 2006) Given a DFA M = {Q,F,S,qo, F). The 
cost function W_Regular’'“’'(5', M) returns m\n{H(Ti,Ti) \ n G L{M)} for each tuple £ G D{S), where 
H(ti,T 2 ) returns the Hamming distance between ri and 72. 



Figure 2: The graphical representation of a DFA. 

Example 2. Consider S = {xi, X 2 , X 3 }, where D(xi) = {a} and D(x 2 ) = D(x^) = {a, 6 }. Given the DFA 
M shown in Figure 2. The cost returned by W_Regular"“’'(S', M)(f) is 1 if £ = (a,b,a). The assignment 
of xz need changed in the tuple (a,b,a) so that L(M) accepts the corresponding string aba. 
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Theorem 3. W_Regular’'“’'(S', M) is polynomially DAG-decomposahle and thus tractable projection-safe. 

Proof. W_Regular*'“’' can be represented as a DAG (Beldiceanu et al., 2004; van Hoeve et al., 2006; Lee & 
Leung, 2012), which directly gives a polynomial DAG-decomposition. In the following, we reuse the symbols 
Si and UY{v), which are defined in the proof of Theorem 1. 

Dehne to be the cost function W_REGULAR"“’'(S'i, Mj), where Mj is the DFA (Q, S, 5, go, {^j}))- 
W_Regular’'“’'(S', M) can be represented as: 

W_Regular““’'(S',M)(£) = min { u }^ sj £)} 


The base cases ujg^ are defined as: 


^So 




0 , ifj = 0 
T, otherwise 


Other sub-cost functions w^., where i > 0, are defined as follows. Define Sq^ = {(qi,v) | S{qi,v) = qj}. If 
6q^ = 0, no transition can lead to qj. 






min {ws,_i(^[S'*-i]) © if <5,, Y 0 

o{qk,v)=qj 

T, otherwise 


The corresponding DAG is shown in Figure 3, based on Example 2. Again, the same notation as in 
Figure 1 is used. The DAG has a number of vertices \V\ = 0(|S'| • |Q|), and its leaves are unary functions. The 
DAG-decomposition is thus polynomial, and, by Theorem 9 by Allouche et al. (2015), the result follows. □ 

Together with Theorem 6.11 by Lee and Leung (2012) which showed that this global cost function was 
flow-based tractable, this result gives another reasoning for its tractability. Theorem 3 also gives another 
proof of the tractable projection-safety of W_Among'““’’ (Solnon et ah, 2008). The Among global constraint 
(Beldiceanu & Gontejean, 1994) can be modeled by the Regular global constraint (Pesant, 2004), which 
the size of the corresponding DFA is polynomial in the size of the scope. 



Figure 3: The DAG corresponding to W_Regular’'“’’ 

Function RegularMin in Algorithm 2 computes the minimum of a W_Regular’'“’'(S', M) cost function. 
The algorithm first initializes the table u by assigning min{[/f} to u[i, c] at lines 1 and 2. Lines 3 to 8 fills up 
the table / of the size nx \Q\. Each entry f[i,j] in / holds the value min{a;g.}, which is computed according 
to the formulation stated in Theorem 3, and returns the result at line 9. 

The time complexity of RegularMin in Algorithm 2 can be stated as follows. 

Theorem 4. Function RegularMin in Algorithm 2 computes the minimum o/W_R egular'““’’(S, M), and 
it requires 0{nd ■ IQI), where n and d are defined in Theorem 2. 
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Proof. Lines 1 and 2 in Algorithm 2 requires 0{n ■ |S|). Because |S| is bounded by d, the time complexity 
is 0{n ■ d). Lines 3 to 8 require 0{nd ■ |Q|), according to the table size. Line 9 requires 0(|(5|). The overall 
time complexity is 0{nd + nd ■ \Q\ + |(5|) = 0{nd ■ |(5|). □ 


Function RegularMin(5', M) 

1 for i := 1 to n do 

2 1^ for c e S do u[i,c] := minju/'^^} ; 

3 /[0,0]:=0; 

4 for qj €Q\ {go} do f[0,j] := T ; 

5 for i := 1 to n do 

6 /[bj]:=T; 

7 foreach {qk,qj,c) such that S{qk, c) = qj do 

8 L /[bi] = ™in{/[bi]./[b^] ® w[bc]}; 

9 return min,^.gF{/[n,i]} ; 

Algorithm 2: Finding the minimum of W_Regular’'“’’ 

We state the time complexity of enforcing GAC* with respect to W_Regular"“’' as follows. 

Corollary 2. Given Ws = W_Regular’'“'’(S', M). Function findSupportf) in Algorithm 3 from Allouche 
et al. (2015) requires 0{nd^ ■ |(5|), where n and d is defined in Theorem 2. 

Proof. Follow directly from Property 2 in Allouche et al. (2015) and Theorem 4. □ 

The time complexity is polynomial in the size of the input {S, M) which includes the scope of size n (with 
associated domains of size at most d) and the finite automata M , including Q. 


2.3 The W_Max/W_Min Cost Functions 

Definition 4. Given a function f{xi,v) that maps every variable-value pair {xi,v), where v € D(xi), to a 
cost m (0 ... T}. 

• The W_Max(S', /)(f), where t G T){S), returns max{/(xi, | Xi G S'}; 

• The W_Min(S, /)(f), where £ G D{S), returns mhi{f{xi,£[xi]) \ Xi G S}. 

Note that the W_Max and W_Min cost functions are not a direct generalization of any global constraints. 
Therefore, their name does not follow the traditional format. However, they can be used to model the 
Maximum and Minimum hard constraints (Beldiceanu, 2001). For examples, the Maximum( ccmaa;j S) can 
be represented as Xmax = W_Max(S, /), where f{xi,v) = v. 

Example 3. Gonsider S = {xi,X 2 ,X 3 }, where D(xi) = {1,3}, D(x 2 ) = {2,4}, and D^xs) = {2,3}. Given 
f{xi,v) = 3 XV, the cost of the tuple (1,2,3) given by W_Max(S, /) is 9, while that o/(3,4, 2) is 12. 

Theorem 5. W_Max(S, /) and W_Min(S, /) are polynomially DAG-decomposable, and thus tractable 
projection-safe. 

Proof. Decomposing W_Max(S, /) and W_Min(S, /) directly using Definition 4 does not lead to a tractable- 
safe DAG-decomposition. We give a polynomial DAG-decomposition of W_Max(S, /) only, since that of 
W_Min(S, /) is similar. For the ease of explanation, we arrange all possible outputs of / in a non-decreasing 
sequence A = [ooi oi, ■. ■, Ofc], where Oi < aj iff f < j. The value oq = 0 is added into the sequence as a base 
case. 

We define three familys of unary cost functions {Hf}, {Gf} and {F!f}- Gost functions {iL“ | Xi G S Au G 
D{xi)} are unary functions on G S' defined as 



f{xi,v), Av = u 
T, a V ^ u 
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The unary cost functions ^ are unary cost functions on xj G S defined as: 

/ 0 : iiak = f{xj,u) 

^ ^ IT, otherwise 

Cost functions {G'“'‘ \ Xj £ S /\ak & A} are unary functions on Xj G S, defined recursively as: 
G““(w) = T 

{ T, if ak > f{xj,u) and Vv, f{xi, v) ^ at 

(u), if ak < f {xj,u) and Vv, f {xi, v) ^ at 

(u), F^'‘ (u)} if ak < f {xj,u) and 3v,f {xi,v) = ak 

They give a DAG-decomposition for W_Max as follows: 

W_Max(5, /)(£)= min 0 

a.eAAa.=/(.„.) 


( 1 ) 


Hy represents the choice of the maximum cost component in the tuple, while G“ represents the choice of 
each component other than the one with the maximum weight. 



Figure 4: The DAG corresponding to W_Max 

The corresponding DAG {V,E) of the decomposition as shown in Figure 4 based on Example 3. The 
notation is the same as Figure 1. The DAG contains \V\ vertices, where \V\ = 0{nd ■ v?d) = 0{n^d‘^). 
By Theorem 6 and 7 by Allouche et al. (2015), the decomposition is safely DAG-decomposition. Moreover, 
the leaves are unary cost functions. The DAG-decomposition is polynomial and, by Theorem 9 by Allouche 
et al. (2015), the result follows. □ 

Example 4. Following Example 3, the sequence A is defined as: 

A = [oq) Q:i , <32) 0 : 3 ; <34) 0 : 5 ) 0!q] 

= [0, f{xi, l),f{x 2 , 2), /(xa, 2), /(xi, 3), /(xa, 3), /(x 2 ,4)] 

= [0,3,6,6,9,9,12] 
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The DAG-decomposition for W_Max can he represented as: 


W_Max(S',c)(£) = m i n { 


Hlie[x3])®Gf%£[xi])®G^%£[x2]), 

Hfi£[xi])(BGr{£[x2])(BG^^{£[x3]), 

Hli£[x3])(BGf^i£[xi])(BG^^i£[x2]), 

H^i£[x2])(BGf%£[xi])(BG^%£[x3]), 

HK£[xi])®G^^{£[x2])®G^^{£[x3]), 


> 


Assume £ = (1,2,3). We first compute the values of {Hf} and {Gf}, incrementally starting from ag. The 
results are shown in Table 1. 


Table 1: Computing the values of {Hf} and {G“} 





G“^ 

Gs^ 

Oq 

- 

T 

T 

T 

= f{xi, 1) = 3 

3 

0 

T 

T 

^2 = f{x2,2) = 6 

6 

0 

0 

T 

as = f{x3,2) = 6 

T 

0 

0 

0 

0:4 = f(xi,3) = 9 

T 

0 

0 

0 

0:5 = f(x3,3) = 9 

9 

0 

0 

0 

ae = f(x2,4) = 12 

T 

0 

0 

0 


WJsJlAx(S,c)(£) can he computed using Table 1, which gives the cost 9. 


W_Max( 5 ', c){£) = min < 


T ©0©0, 
T©0® 0, 
9©0® 0, 
T ©0® 0, 
6©0©T, 
3©T © T, 


> 


= 9 


Function WMaxMin in Algorithm 3 computes the minimum of a W_Max(S', c) cost function, based on 
Equation 1. The one for W_Min(S', c) is similar. The for-loop at line 5 tried every possible variable-value 
pair (xi,a) in the non-decreasing order of f{xi,a). At each iteration, it first computes the minimum among 
all tuple £ which £lxi] = v and it is the maximum weighted component in the tuple in line 7, and update the 
global minimum in line 8. The variables { 3 [a;i]} is then updated in line 9. They store the current minimum 
of {Gf}, which is used for compute the minimum among tuples with £lxi] is not the maximum weighted 
component. 

The time complexity is given by the theorem below. 

Theorem 6. Function WMaxMin in Algorithm 3 computes the minimum of W_Max(5', /), and it requires 
0(nd • \og(nd)), where n and d are defined in Theorem 2. 

Proof. Line 3 takes 0{nd ■ log(n(i)) to sort. The for-loop at line 5 iterates nd times. All operations in the 
iteration requires 0(1) except line 7. As it is, line 7 requires 0(n). By using special data structure like 
segment trees (Bentley, 1977), the time complexity can be reduced to 0(log(n)). The overall complexity 
becomes 0{nd ■ log(n(i) + nd ■ log(n)) = 0{nd ■ log(nc?)). □ 

Corollary 3. Given Ws = W_Max(5', /). The function findSupportf) in Algorithm 3 from Allouche 
et al. (2015) requires 0(nd^ ■ log(n(i)), where n and d is defined in Theorem 2. 

Proof. Follow directly from Property 2 in Allouche et al. (2015) and Theorem 6. □ 
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1 


2 

3 

4 

5 

6 

7 

8 
9 


Function WMaxMin(5, /) 

for i := 1 to n do g[xi] := T ; 
curMin := T ; 

A := {(xi,v) j Xi € S Av G B(x^)}; 
sort A in the nondecreasing order of f{xijv); 
foreach (xi^v) according to the sorted list A do 
a := f{xi,v); 

curCost := H^{v) © 9 [xj]; 

curMin := minjcurMin, curCost}; 
g[xi] := mm{g[xi],Gf{v)}; 
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return curMin ; 


Algorithm 3: Finding the minimum of W_Max 


3 Conclusion 

In this manuscript, we have shown that W_Among’'“’', W_Regular"“’' ,W_Max and W_Min, are polyno- 
mially DAG-decomposable. We also give the respective polytime dynamic programming based algorithms to 
compute the minimum of this class of global cost functions. 
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